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PREFACE 


The original version of this book submitted to the David A. 
Wells Prize Committee of Harvard University in 1941 carried the 
subtitle, “The Operational Significance of Economic Theory.” At 
that time most of the material presented was already several years 
old, having been conceived and written primarily in 193 7. Further 
delay in publication has been necessary because of the war, and 
because of the addition of supplementary treatise-like material 
going beyond the original conception of the work as indicated by 
its subtitle. 

Because of the pressure of war work I have not been able to 
do full justice to the literature of the last few years, nor even to 
include all of the developments of my own thinking. Fortunately, 
the passage of time has dealt kindly with the analysis contained 
here, and where it abuts upon the topics treated in Professor 
Hicks’s masterly Value and Capital, the similarity in point of view 
has been reassuring. 

My greatest debt is to Marion Crawford Samuelson whose con- 
tributions have been all too many. The result has been a vast 
mathematical, economic, and stylistic improvement. Without her 
collaboration the book would literally not have been written, and 
no perfunctory uxorial acknowledgment can do justice to her aid. 
Nor can the quaint modern custom of excluding the value of a wife’s 
services from the national income condone her exclusion from the 
title page. 

My thanks for prolonged stimulation over many years must go 
out to Professors Schumpeter, Leontief, and E. B. Wilson, while 
each of a legion of Harvard graduate students has left his mark 
upon what follows. The reader will note my dependence upon the 
sterling contribution to Welfare Economics of Professor Abram 
Bergson. Grateful acknowledgment is made to the Social Science 
Research Council and to the Society of Fellows of Harvard Uni- 
versity for the opportunities they provided for pursuit of inde- 
pendent research, and to the Department of Economics of Harvard 
University for their courteous acceptance of the wartime delays 
in publication. 
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Acknowledgment is made to the editors of Econometrica and 
the Review of Economic Statistics for permission to reproduce parts 
of my previously published articles. Chapters IX and X are taken 
almost entirely from two articles that appeared in Econometrica , 
while part of Chapter XI appeared in the Review of Economic 
Statistics. 


Cambridge, Massachusetts 
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FOUNDATIONS OF ECONOMIC ANALYSIS 
PART I 




CHAPTER I 


INTRODUCTION 

The existence of analogies between central features of various theories 
implies the existence of a general theory which underlies the particular 
theories and unifies them with respect to those central features. This 
fundamental principle of generalization by abstraction was enunci- 
ated by the eminent American mathematician E. H. Moore more 
than thirty years ago. It is the purpose of the pages that follow 
to work out its implications for theoretical and applied economics. 

An economist of very keen intuition would perhaps have sus- 
pected from the beginning that seemingly diverse fields — produc- 
tion economics, consumer’s behavior, international trade, public 
finance, business cycles, income analysis — possess striking formal 
similarities, and that economy of effort would result from analyzing 
these common elements. 

I can make no claim to such initial insight. Only after labori- 
ous work in each of these fields did the realization dawn upon me 
that essentially the same inequalities and theorems appeared again 
and again, and that I was simply proving the same theorems a 
wasteful number of times. 

I was aware, of course, that each field involved interdependent 
unknowns determined by presumably efficacious, independent equi- 
librium conditions — a fact which has always been generally real- 
ized. But, and this leads me to the second fundamental purpose 
of this work, it had not been pointed out to my knowledge that 
there exist formally identical meaningful theorems in these fields, 
each derived by an essentially analogous method. 

This is not surprising since only the smallest fraction of eco- 
nomic writings, theoretical and applied, has been concerned with 
the derivation of operationally meaningful theorems. In part at 
least this has been the result of the bad methodological precon- 
ceptions that economic laws deduced from a priori assumptions 
possessed rigor and validity independently of any empirical human 
behavior. But only a very few economists have gone so far as this. 

3 
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The majority would have been glad to enunciate meaningful 
theorems if any had occurred to them. In fact, the literature 
abounds with false generalization. 

We do not have to dig deep to find examples. Literally 
hundreds of learned papers have been written on the subject of 
utility. Take a little bad psychology, add a dash of bad philosophy 
and ethics, and liberal quantities of bad logic, and any economist 
can prove that the demand curve for a commodity is negatively 
inclined. His instinct is good ; the attempt to derive a meaningful 
useful theorem is commendable — much more so than the innocuous 
position that utility is always maximized because people do what 
they do. How refreshing then is a paper like Slutsky’s 1 which 
attempted, with partial success, to deduce once and for all the 
hypotheses upon price-quantity budget behavior implied in the 
utility approach. 

The economist has consoled himself for his barren results with 
the thought that he was forging tools which would eventually yield 
fruit. The promise is always in the future; we are like highly 
trained athletes who never run a race, and in consequence grow 
stale. It is still too early to determine whether the innovations 
in thought of the last decade will have stemmed the unmistakable 
signs of decadence which were clearly present in economic thought 
prior to 1930. 

By a meaningful theorem 1 mean simply a hypothesis about 
empirical data which could conceivably be refuted, if only under 
ideal conditions. A meaningful theorem may be false. It may 
be valid but of trivial importance. Its validity may be indetermi- 
nate, and practically difficult or impossible to determine. Thus, 
with existing data, it may be impossible to check upon the hy- 
pothesis that the demand for salt is of elasticity —1.0. But it is 
meaningful because under ideal circumstances an experiment could 
be devised whereby one could hope to refute the hypothesis. 

The statement that if demand were inelastic, an increase in 
price would raise total revenue is not a meaningful theorem in this 
sense. It implies no hypothesis — certainly not even that a demand 
exists which is inelastic — and is true simply by definition. It may 
possibly have had a certain “psychological” usefulness in helping 

1 E. Slutsky, “Sulla teoria del biiantio del consumatore,” Giornale degU Economisti, 
LI (1915), 1-26. 
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economists ask the right questions of the facts, but even here I 
have some doubts. 

In this study I attempt to show that there do exist meaningful 
theorems in diverse fields of economic affairs. They are not de- 
duced from thin air or from a priori propositions of universal truth 
and vacuous applicability. They proceed almost wholly from two 
types of very general hypotheses. The first is that the conditions 
of equilibrium are equivalent to the maximization (minimization) 
of some magnitude. Part I deals with this phase of the subject in 
a reasonably exhaustive fashion. 

However, when we leave single economic units, the determina- 
tion of unknowns is found to be unrelated to an extremum position. 
In even the simplest business cycle theories there ^s lacking sym- 
metry in the conditions of equilibrium so that there is no possibility 
of directly reducing the problem to that of a maximum or mini- 
mum. Instead the dynamical properties of the system are speci- 
fied, and the hypothesis is made that the system is in “stable” 
equilibrium or motion. By means of what I have called the Corre- 
spondence Principle between comparative statics and dynamics, 
definite operationally meaningful theorems can be derived from so 
simple a hypothesis. One interested only in fruitful statics must 
study dynamics. 

The empirical validity or fruitfulness of the theorems, of course, 
cannot surpass that of the original hypothesis. Moreover, the 
stability hypothesis has no teleological 2 or normative significance ; 
thus, the stable equilibrium might be at fifty per cent unemploy- 
ment. The plausibility of such a stability hypothesis is suggested 
by the consideration that positions of unstable equilibrium, even 
if they exist, are transient, nonpersistent states, and hence on the 
crudest probability calculation would be observed less frequently 
than stable states. How many times has the reader seen an egg 
standing upon its end? From a formal point of view it is often 
convenient to consider the stability of nonstationary motions. 

In a good deal of Part II the dynamical behavior of systems is 
analyzed for its own sake, regardless of implications for compara- 
tive statics. And in the last chapters of Part I, I have gone beyond 
the original conception of the book to include such subjects as 

2 L. J. Henderson, The Order of Nature (Cambridge, Massachusetts: Harvard 
University Press, 1917). 
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welfare economics. Although the logical content of the theorems 
enunciated here is different, there is an underlying unity of method. 

In the beginning it was hoped that the discussion could be made 
nontechnical. Very quickly it became apparent that such a pro- 
cedure, while possible, would involve a manuscript many times 
the present size. Moreover, I have come to feel that Marshall’s 
dictum that “it seems doubtful whether any one spends his time 
well in reading lengthy translations of economic doctrines into 
mathematics, that have not been made by himself” should be 
exactly reversed. The laborious literary working over of essen- 
tially simple mathematical concepts such as is characteristic of 
much of modern economic theory is not only unrewarding from 
the standpoint c?f advancing the science, but involves as well mental 
gymnastics of a peculiarly depraved type. 

On the other hand, I have attempted to avoid all mathematical 
flourish, and the pure mathematician will recognize all too readily 
the essentially elementary character of the tools used. My own 
interest in mathematics has been secondary and subsequent to my 
interest in economics. Nevertheless, the reader may find some 
parts hard going. To lighten the way I have placed the purely 
mathematical theorems in two separate appendices, of which the 
second gives a reasonably self-contained introduction to the theory 
of difference equations. 


CHAPTER II 


EQUILIBRIUM SYSTEMS AND COMPARATIVE 

STATICS 

Most economic treatises are concerned with either the descrip- 
tion of some part of the world of reality or with the elaboration of 
particular elements abstracted from reality. Implicit in such 
analyses there are certain recognizable formal uniformities, which 
are indeed characteristic of all scientific method. .It is proposed 
here to investigate these common features in the hope of demon- 
strating how it is possible to deduce general principles which can 
serve to unify large sectors of present day economic theory. 

In every problem of economic theory certain variables (quan- 
tities, prices, etc.) are designated as unknowns, in whose determina- 
tion we are interested. Their values emerge as a solution of a 
specified set of relationships imposed upon the unknowns by as- 
sumption or hypothesis. These functional relationships hold as of 
a given environment and milieu. Of course, to designate this 
environment completely would require specification of the whole 
universe; therefore, we assume implicitly a matrix of conditions 
within which our analysis is to take place. 

It is hardly enough, however, to show that under certain condi- 
tions we can name enough relations (equations) to determine the 
values of our unknowns. It is important that our analysis be 
developed in such terms that we are aided in determining how our 
variables change qualitatively or quantitatively with changes in 
explicit data. Thus, we introduce explicitly into our system cer- 
tain data in the form of parameters, which in changing cause shifts 
in our functional relations. The usefulness of our theory emerges 
from the fact that by our analysis we are often able to determine 
the nature of the changes in our unknown variables resulting from 
a designated change in one or more parameters. In fact, our 
theory is meaningless in the operational sense unless it does imply 
some restrictions upon empirically observable quantities, by which 
it could conceivably be refuted. 
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This in brief is the method of comparative statics, meaning by 
this the investigation of changes in a system from one position of 
equilibrium to another without regard to the transitional process 
involved in the adjustment. By equilibrium is meant here only 
the values of variables determined by a set of conditions, and no 
normative connotation attaches to the term. As will be shown 
later, it is always possible to set up completely trivial equilibrium 
systems. 

This method of comparative statics is but one special applica- 
tion of the more general practice of scientific deduction in which 
the behavior of a system (possibly through time) is defined in 
terms of a given set of functional equations and initial conditions. 
Thus, a good cteai of theoretical physics consists of the assumption 
of second order differential equations sufficient in number to de- 
termine the evolution through time of all variables subject to given 
initial conditions of position and velocity. Similarly, in the field 
of economics dynamic systems involving the relationship between 
variables at different points of time (e.g., time derivatives, weighted 
integrals, lag variables, functionals, etc.) have been suggested for 
the purpose of determining the evolution of a set of economic vari- 
ables through time . 1 At a later stage I deal with these dynamic 
problems. 

The concept of an equilibrium system outlined above is ap- 
plicable as well to the case of a single variable as to so-called 
general equilibrium involving thousands of variables. Logically 
the determination of output of a given firm under pure competition 
is precisely the same as the simultaneous determination of thous- 
ands of prices and quantities. In every case ceteris paribus as- 
sumptions must be made. The only difference lies in the fact that 
in the general equilibrium analysis of, let us say, Walras, the con- 
tent of the historical discipline of theoretical economics is prac- 
tically exhausted. The things which are taken as data for that 
system happen to be matters which economists have traditionally 
chosen not to consider as within their province. Among these 
data may be mentioned tastes, technology, the governmental and 
institutional framework, and many others. 

1 R. Frisch, “On the Notion of Equilibrium and Disequilibrium/' Review of Economic 
Studies , III (1936), 100-105. J* Tinbergen, Annual Survey: Suggestions on Quantitative 
Business Cycle Theory, Econometrica, III (1935), 241-308. 
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It is clear, however, that logically there is nothing fundamental 
about the traditional boundaries of economic science. In fact, a 
system may be as broad or as narrow as we please depending upon 
the purpose at hand ; and the data of one system may be the vari- 
ables of a wider system depending upon ^expediency. The fruit- 
fulness of any theory will hinge upon the degree to which factors 
relevant to the particular investigation at hand are brought into 
sharp focus. And if, for the understanding of the business cycle 
a theory of governmental policy is demanded, the economist can 
ill afford to neglect this need on the ground that such matters lie 
outside his province. As for those who argue that special degrees 
of certainty and empirical validity attach to the relations encom- 
passed within the traditional limits of economic theory, we may 
leave to them the task of proving their case. 

It is not to be thought that the content of systems as described 
above must be restricted to the variables usually considered in 
price and value theory. On the contrary, one employs such con- 
structions throughout the whole field of theoretical economics in- 
cluding monetary and business cycle theory, international trade, 
etc. It goes without saying that the existence of such systems in 
no way hinges upon the employment of symbolic or mathematical 
methods. In fact, any sector of economic theory which cannot be 
cast into the mold of such a system must be regarded with suspicion 
as suffering from haziness. 

Within the framework of any system the relationships between 
our variables are strictly those of mutual interdependence. It is 
sterile and misleading to speak of one variable as causing or de- 
termining another. Once the conditions of equilibrium are im- 
posed, all variables are simultaneously determined. Indeed, from 
the standpoint of comparative statics equilibrium is not something 
which is attained; it is something which, if attained, displays 
certain properties. 

The only sense in which the use of the term causation is ad- 
missible is in respect to changes in external data or parameters. 
As a figure of speech, it may be said that changes in these cause 
changes in the variables of our system. An increase in demand, 
i.e., a shift in the demand function due to a change in the data, 
tastes, may be said to cause an increased output to be sold. Even 
here, when several parameters change simultaneously, it is im- 
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possible to speak of causation attributable to each except in respect 
to limiting rates of change (partial derivatives). 

Symbolic Formulation 

All of the above may be stated compactly in mathematical 
form. Given n variables or unknowns (xx, • • • , x s ) and m, greater 
or less than n, parameters (an, • • •, a m ), we assume n independent 
and consistent functional relationships involving our variables and 
parameters. These may be written most generally in implicit 
form, each equation involving all variables and parameters. 

f (Xx, * j &ni CO, , OCm) b, 

* • • *, x n , a u • • •, a m ) = 0, 

: a) 

/ (x x , * , X„ , Oi x , > UJ,„) 0 , 

or more compactly, 

J*(yX 1, * , X„, G!x, ' , b. ( f 1} ’ " ‘ i ») 

Our equations must not be greater than n in number, for if they 
were, they could not be both consistent and independent; if less, 
our system will, in general, be under-determined. If our equations 
possess certain characteristics, to be discussed later, they may be 
considered to determine a unique set of values of our unknowns 
(xi°, • • •, x„°) corresponding to any preassigned set of parameters 

(ax°, • ■ <x m °). 

This functional relationship may be expressed mathematically 
as follows: 

%t — ^(oil, ', Oim ) . ('f ‘ 1| * ' i Tl) (2) 

It should be understood that this does not imply that we can 
express our unknown variables as any elementary functions of the 
parameters (such as polynomial, trigonometric, or logarithmic 
functions). On the contrary, our conditions of equilibrium in set 
(1) will not in general be expressible in terms of a finite number of 
elementary functions; and even if they were, we should not be sure 
that they could be solved explicitly in any simple terms. How- 
ever, this is of no particular importance, for the elementary func- 
tions are only special forms which have been of historical interest 
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in the development of mathematical thought and its physical ap- 
plications. If one were to draw free hand either at random or as a 
result of a complete set of observations a demand curve relating 
price and quantity, this could not in general be more than approxi- 
mately represented by finite combinations of elementary functions. 
And yet it is a perfectly good functional relationship indicating a 
certain correspondence between these variables. Furthermore, if 
there were any practical need for so doing, it could be tabulated 
once and for all, be named, and ever afterwards be accepted as a 
member of the family of respectable functions. 

It is precisely because theoretical economics does not confine 
itself to specific narrow types of functions that it is able to achieve 
wide generality in its initial formulation. Still it is'not to be for- 
gotten that the aim of fruitful inference is the explanation of a wide 
range of phenomena in terms of simple, restrictive hypotheses. 
However, this must be the final result of our research, and there is 
no point in crippling oneself in setting out upon the journey. 

If one were omniscient, i.e., if all the implications of any as- 
sumptions were intuitively obvious, the equations (2) would be 
instantly known as soon as the set (1) were given. Short of such 
powers, they could only be solved for to any required degree of 
approximation with much effort; but, of course, it could be done, 
given the requisite time and patience. 

Considering the amount of labor involved, one is tempted to 
question the advantage of starting from our equilibrium equations 
in (1) . Why not begin directly with the equations in (2) ? Indeed, 
it may be pointed out that these resulting functions between un- 
knowns and parameters could have arisen from an infinity of 
possible alternative sets of original equations. In particular, con- 
sider the set of implicit equations 

‘ . a m) — Xi = 0. (i = 1, • • • , n) (3) 

This can be solved by inspection to give the equations of (2), but 
of course the solution is trivial. It is trivial in the sense that the 
result is intuitively obvious from the beginning, and not for the 
reason that (2) says the same thing as (3). For, after all, this 
equivalence is also true for (1) and (2), but their Identity is not 
trivial in this psychological sense. 

It is important not to be confused in these matters, for they lie 
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at the foundations of scientific deduction, and they have been mis- 
understood particularly often by economists. By deductive rea- 
soning we are enabled only to reveal to ourselves implications 
already included in our assumptions. We may bring to explicit at- 
tention certain formulations of our original assumptions which ad- 
mit of possible refutation (confirmation) by empirical observation. 

This process may best be regarded as a translation of our origi- 
nal hypothesis into a different language; but in so translating— 
provided, of course, that no error in logic has crept in — we do not 
change the nature of our original hypothesis, neither adding nor 
subtracting from its validity and precision. 

The usefulness of the formulation of equilibrium conditions 
from which emerges our solution lies in the fact that by so doing 
we often gain knowledge concerning possible and necessary re- 
sponses of our variables to changes in data. Without such restric- 
tions our theories would be meaningless. Merely to state, as was 
suggested earlier, that there exists a final functional relationship 
between all variables and parameters (as of an infinity of accom- 
panying circumstances) is bare and formal, containing no hy- 
pothesis upon the empirical data. 

It is because in a wide variety of cases we can more or less 
plausibly assume or hypothesize certain properties of our equi- 
librium equations that we are able to deduce with the same degree 
of plausibility certain properties of the resulting functions between 
our unknowns and parameters. For the properties of the functions 
in (2) are necessarily related to the structural characteristics of the 
equilibrium set (1). The properties ordinarily discussed in this 
connection are not specific quantitative restrictions upon the func- 
tions (as being polynomials, etc.), but rather consist of assumptions 
with respect to slope, curvature, monotonicity, etc. ; they are the 
kind implied by the law of diminishing returns. 

Displacement of Equilibrium 

It is easy to show mathematically how the rates of change of 
our unknowns with respect to any parameter, say a 1( may be com- 
puted from our equilibrium equations. As a matter of notation let 

dxf _ dgfa i°, • • • , « ro °) 

Soa] dct\ 


~ gifcii, a* 0 ) 
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stand for the rate of change of the it h variable with respect to the 
parameter a%, all other parameters being held constant. It is 
necessary because of the ambiguity of the conventional notation 
of partial differentiation to make sure which variables are being 
held constant. 

Such partial derivatives must be evaluated as of a given value 
of the set of parameters, and hence as of a corresponding set of 
values of our dependent variables. Consider the initial position 

(«i°» • • ‘ > a™ 0 ), 
and the corresponding set of unknowns 


where of course 


• x ^ 

j >A'n / j 


f(x 1 °, • • • , Xn°, Oil 0 , • * *, aj) = 0, (i = 1, • • • , n) (4) 

and 

Xi° = g^Oi 0 , • • * , aj), (i — 1, * • • , n) (5) 

since our unknowns must satisfy the equilibrium conditions. 
Differentiating each equation of (1) with respect to a h remember- 
ing that all other parameters are to be held constant, but that all 
our unknowns are variable, we get 2 




with all other variables and parameters held constant. Similarly, 


df(x i°, • * - , x n ° , ai°, — i aj) 

dai 


2 In matrix terms this is 
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Be it noted that the numerical values of these partial derivatives 
are fully determined at the equilibrium point in question. Thus, 
we have n linear equations with constant coefficients in n unknowns 
£(dxi/do!i) 0 , • • •, dx»/da:i) 0 ]. The solution values of these will de- 
pend upon the values of the coefficients; thus, the partial deriva- 
tives relating our dependent variables and parameters are deter- 
mined by the structural properties of our equilibrium system. 

Since (6) represents linear equations, their solution for non- 
singular cases may be represented in the familiar determinantal 
form : 


where 



n 


T, fat* Aik 


A 


(k = 1, • • •, n) 


U 1 U 1 ••• u 

f 2 f 2 ... f 2 

Jx x Jx 2 J z n 



U. 


n 



and A a indicates the cofactor of the element of the ith row and the 
&th column. Or in matrix terms 


[IS]" " “ («) 


Illustrative Tax Problem 

For concreteness let us apply our analysis to two simple cases. 
Consider a firm with a given demand curve relating price and out- 
put and a given production cost schedule relating total cost and 
output. Suppose in addition that the output of the firm is subject 
to a tax of t dollars per unit. The profit of the firm may be written 

7 r = total revenue — total production cost — total tax payment 
= xp(x) — C(x ) — tx, 

where 

xp(x) = total revenue in function of output, 

C(x) = lowest total production cost at which each output can 
be produced, 
tx = total tax payment. 
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It is clear that for any given tax rate, say t°, the firm will decide 
upon some given output to be produced and sold; i.e., 


*° = g(f), (9) 

where the functional relationship g corresponds to the functions 
in (2). However, we cannot leave the matter in so indefinite a 
state. We wish to know more than that there exists an equilibrium 
output for each tax rate. What is the nature of the dependence 
of our variable upon the tax rate regarded as a parameter? Will 
an increased unit tax result in a larger or smaller output? It is a 
poor theory indeed which will not answer so simple a question. 
Let us see if we can arrive at an answer to this through the formula- 
tion of the conditions of equilibrium. • 

It is in general assumed that a firm will select that output which 
will maximize its net revenue. That is to say, our equilibrium 
value for output will emerge as a solution of a simple maximum 
problem. Specifically, it is necessary 3 for a regular maximum of 
profit with respect to x, as of a given tax rate-, that 


chrjx, t) _ - 
dx 

d 2 Tr(x, t ) 
dx 2 ^ U 


( 10 ) 


The first condition merely states that at a maximum the tangential 
slope of the profit function plotted against output must be hori- 
zontal or equal algebraically to zero. The second condition in- 
sures that we do not have a minimum. 

For the problem at hand our condition of equilibrium may be 
derived by simple differentiation to become 

y x lxp(x) - C(x)2~t = 0, (11) 

where it is assumed for simplicity that the inequality in (10) is 
realized for all values of the variable concerned. Now equation 
(11) corresponds to our equilibrium set (1), which in this case con- 
tains but one equation due to the fact that we have only the value 
of one unknown to determine. 

3 See Mathematical Appendix A, sec. L 
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For each value of t, there will correspond a root of the resulting 
equation in x, and this will be our equilibrium value. Thus, the 
function (9) may be regarded as the explicit solution of this implicit 
equation. 

Now what have we gained by introducing the maximum prob- 
lem for the firm? Does this enable us to answer our original ques- 
tion as to the nature of the dependence of the output upon the tax 
rate? Let us apply the general method outlined above to calculate 
the rate of change of equilibrium output with respect to the 
parameter t. Differentiating (11) with respect to t, we get 

, 1 , ( 12 ) 

where 



In this simple case no resort to determinants is necessary in order 
to achieve a solution, for 

i? ' (i3) 

0 $ [x°K*°) - C(x 0 )] 

But this provides us with the answer for which we have been seek- 
ing. For as a sufficient condition for a relative maximum we 
know that 

|k*W) - C(x«)] <0. (14) 

Therefore, 

(|f)° < 0, or g'(f) < 0, (15) 

which is what intuition tells us would happen as a result of such a 
tax. Be it noted in passing that it is assumed that the firm is 
always in equilibrium before and after the tax is imposed, and that 
the tax affects the equilibrium only as indicated in equation (11). 
In any actual case considerable attention must be devoted to the 
problem of verifying the correctness of these assumptions before 
any practical application is made of the conclusions reached. 
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Illustrative Market Case 4 

Let us consider as another example a market for a good or 
service where the price and quantity are determined by the inter- 
section of hypothetical supply and demand curves. Furthermore, 
let us introduce a parameter of shift, a, into our demand curve 
(e.g., tastes, tax, rival price, etc.). Here we have two variables, 
one parameter, and two equations to define equilibrium values of 
our variables in terms of the parameter. Mathematically, 

D(x , a) — p = 0, 

S(x) - p = 0. 

As a solution we have 

x° = g x (a 0 ), 

P° = g 2 (a°). 

How then will our variables change with changes in a assuming 
that an increase in a shifts the demand curve upwards and to the 
right? As before, we differentiate our equilibrium relations with 
respect to the parameter to get two linear equations, 

dD/dxy _ /dpy _ _ e>D 

dx \ da ) \da/ da 

By simple substitution we get 

/ dx \° _ 

= 


Now we know that 3D /da > 0 by definition of our parameter of 
shift. Thus, (dx/3a)°$ 0 depending upon whether S' g dD/dx. 
At first sight this seems merely to put off the dilemma, to replace 
one equation by another. But if we examine the kind of market 

4 This is treated in greater detail in chap, ix below. 


3D 

3a 


3D 

dx 


S' 


„3D 

5 da 


3D 

dx 


S' 


(19) 


(16) 

(17) 
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under consideration, it will appear that the mere fact that the 
market is in stable equilibrium in the initial situation will remove 
all ambiguity. For if the market is the familiar Marshallian con- 
sumer’s good market, stability of equilibrium by definition requires 
that the supply curve cut the demand curve from below (even in 
the case of decreasing cost due to external economies). 5 

Thus, 





( 20 ) 


Therefore, 



( 21 ) 


However, the algebraic sign of the price change will hinge upon 
whether there is a positively or negatively inclined supply curve. 
For 



( 22 ) 


Thus, since (dx/da)° is positive from (21), ( dp/da)° and S' must be 
of the same sign, or 



S' > 0. 


( 23 ) 


It is impossible in the price case to get rid of the final ambiguity. 

Suppose, however, that this were a market for a factor of pro- 
duction. Here the conditions for stable equilibrium are familiarly 
defined to be either a positively sloping supply curve with a nega- 
tively sloping demand curve; or if the supply curve is negatively 
inclined, it must be backward rising and steeper than the demand 
curve. 6 Mathematically, our stability conditions may be expressed 



< 0 . 


( 24 ) 


For this case, therefore, the sign of the price change is known, while 
the quantity change is ambiguous, depending upon the algebraic 

5 A, Marshall, Principles of Economics (8th edition), p. 346, n. 1, p. 806, n. 1. 

6 J. R. Hicks, Value and Capital (Oxford, 1939), chap v. 
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sign of the slope of the supply curve, fn brief, 



Innumerable other examples could be cited. In general, we 
should not expect to be able to determine the signs of the rates of 
change of our variables upon the basis of simple a priori qualitative 
restrictions on our equilibrium equations. This is not because of 
the difficulty and complexity of solving a large number of equa- 
tions ; for these could be solved if sufficient were known about the 
particular empirical values of our conditions of equilibrium. It is 
rather that the restrictions imposed by our hypotheses on our 
equilibrium equations (stability and maximum conditions, etc.) 
are not always sufficient to indicate definite restrictions as to alge- 
braic sign of the rates of change of our variables with respect to 
any parameter. 

• Only imagine a change in a parameter which enters into all of 
a large number of equilibrium equations causing them simultane- 
ously to shift. The resulting net effect upon our variables could 
only be calculated as a result of balancing the separate effects 
(regarded as limiting rates of change), and for this purpose detailed 
quantitative values for all the coefficients involved would have 
to be known. 

Summary 

Before going on in the next section of our work to indicate how 
the economist is enabled to deduce significant results in a wide 
category of cases, it will be well to summarize the thread of the 
argument thus far. 

a. For theoretical purposes an economic system consists of a 
designated set of unknowns which are constrained as a condition 
of equilibrium to satisfy an equal number of consistent and inde- 
pendent equations (see equation 1). These are implicitly assumed 
to hold within a certain environment and as of certain data. Some 
parts of these data are introduced as explicit parameters; and, as 
a result of our equilibrium conditions, our unknown variables may 
be expressed in function of these parameters (see equation 2). 
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b. The method of comparative statics consists of the study of 
the responses of our equilibrium unknowns to designated changes 
in parameters; i.e. , we wish to know the properties of the functions 
in (2). In the absence of complete quantitative information con- 
cerning our equilibrium equations, it is hoped to be able to formu- 
late qualitative restrictions on slopes, curvatures, etc., of our equi- 
librium equations so as to be able to derive definite qualitative 
restrictions upon the responses of our system to changes in certain 
parameters. It is the primary purpose of this work to indicate 
how this is possible in a wide range of economic problems. 


CHAPTER III 


THE THEORY OF MAXIMIZING BEHAVIOR 

Three Sources of Meaningful Theorems 

It will be remembered that in the case of a unit tax on a firm’s 
output it was possible to state unambiguously the direction of 
change of output with respect to a change in the tax rate. This 
was so because of the fact that the equilibrium output for each tax 
rate was derived from the condition that profit must be at a 
maximum. 

As will become evident in the course of our exposition, this is 
by no means an accidental, isolated case ; it is merely an application 
of a very general principle of economic method, which lies at the 
bottom of much of economic theory. In fact, aside from those 
parts of economic doctrine whose results are inconclusive — and the 
most ardent advocates of economic theory will admit that this 
includes a large part of accepted analysis — there is not much which 
cannot be brought under this heading . 1 

The general method involved may be very simply stated. In 
cases where the equilibrium values of our variables can be regarded as 
the solutions of an extremum ( maximum or minimum ) problem, it is 
often possible regardless of the number of variables involved to de- 
termine unambiguously the qualitative behavior of our solution values 
in respect to changes of parameters . 2 

It so happens that in a wide number of economic problems it 
is admissible and even mandatory to regard our equilibrium equa- 
tions as maximizing (minimizing) conditions. A large part of en- 
trepreneurial behavior is directed towards maximization of profits 
with certain implications for minimization of expenditure, etc. 

1 The reader may verify this result by paging through any good economics textbook, 
such as Marshall’s Principles , and analyzing the derivation of various theorems stated. 

2 It may be pointed out that this is essentially the method of thermodynamics, 
which can be regarded as a purely deductive science based upon certain postulates 
(notably the First and Second Laws of Thermodynamics). That such abstract reasoning 
should in the hands of Gibbs and others lead to fruitful theorems testifies to the validity 
of the original hypotheses. 
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Moreover, it is possible to derive operationally meaningful restric- 
tive hypotheses on consumers’ demand functions from the assump- 
tion that consumers behave so as to maximize an ordinal preference 
scale of quantities of consumption goods and services. (Of course, 
this does not imply that they behave rationally in any normative 
sense.) 

It is not to be thought that in principle all economic results 
emerge from these maximizing assumptions . 3 For, as we have 
seen, it was also possible to deduce conclusive qualitative results 
from certain stability assumptions. Nevertheless, many of these 
stability conditions rest implicitly upon maximizing behavior. 

Moreover, certain difficulties arise here. While, of course, it is 
always possible to lay down arbitrary definitions of stability, it 
is impossible to deduce them without the implicit introduction of 
dynamical considerations concerning the behavior of a system out 
of stationary equilibrium. Depending upon the dynamical set-up 
envisaged, different stability conditions are implied. Thus, given 
supply adjustments of the type assumed in the cobweb cycle phe- 
nomenon, it is well known that the ordinary Marshallian condition 
of a positively rising supply curve may not result in “stable” 
equilibrium . 4 

It is time that the identification of equilibrium with a stable 
maximum position in a sense begs the question of stability. How- 
ever, where these extremum conditions are realized, it can be shown 
that many dynamic set-ups will give rise to dampened oscillations 
as a result of small displacements. The relations between dynamic 
theory and the stability of equilibrium are discussed in the last 
chapters. 

There remains still another possibility by which conclusive 
theorems may be deduced. We may know in advance certain 
qualitative properties of our equilibrium equations. Thus, refer- 
ence may be made to alleged technological and psychological laws, 

5 Thus, the definition of economic theory as the study of scarce means with alterna- 
tive uses I would regard as too broad from one point of view, and much too narrow 
from another. 

4 A stationary equilibrium is stable, providing the specification of initial conditions 
differing only slightly from the stationary equilibrium values results in an evolution 
which tends (at least in the limit) to approach the equilibrium values. See Part II 
below. 
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held to be plausible upon a priori grounds . 5 Even here, as many 
economists have pointed out, the reasoning may rest in the 
last analysis upon certain maximum considerations. Thus, in 
demonstrating the validity of the law of diminishing marginal phys- 
ical productivity we ordinarily point to the fact that firms in pure 
competition are in equilibrium under a given set of factor prices. 
This would not be possible if the law of diminishing marginal pro- 
ductivity did not hold. Again we point to the fact that farmers 
do not devote all their attention to raising grain on one square inch 
of land; they use much land of the same kind, and even inferior 
grades of land. Thus, we really argue backwards from maximizing 
economic behavior to the underlying physical data consistent 
with it. * 

There is still another type of problem for which the study of 
maximizing behavior is illuminating. In some cases, as we shall 
see later, it is possible to formulate our conditions of equilibrium 
as those of an extremum problem, even though it is admittedly not 
a case of any individual’s behaving in a maximizing manner, just 
as it is often possible in classical dynamics to express the path of 
a particle as one which maximizes (minimizes) some quantity 
despite the fact that the particle is obviously not acting consciously 
or purposively. 

For all of these reasons the study of maximizing behavior affords 
a unified approach to wide areas of current and historical economic 
thought. There is, moreover, considerable advantage in discussing 
the problem at first in its full generality. The high degree of 
abstractness will be more than compensated for in the ease with 
which numerous applications can be deduced as special cases. 

A Calculus of Qualitative Relations 

Before I enter upon the theory of maximizing behavior, it may 
be illuminating to show why intuition and a general feeling for the 
direction of things does not carry one far in the analysis of a com- 
plex many- variable system, such as is characterized by equation 

6 In a problem of any complexity involving a number of variables, intuition is a 
poor guide for the reasons discussed in the next section. All presumptions become 
dubious. In such cases the economist often falls prey to the perils of assuming the 
equiprobability of the unknown. As a result, every reformulation of the problem results 
in changed presumptions. This is no doubt one reason why every terminological 
revolution in economic thought brings with it a recasting of belief. 
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(1) of the previous chapter. To make matters even simpler than 
they usually are in reality, let us suppose that we know the quali- 
tative direction of movement of each of our equilibrium equations 
with respect to a change in all of the variables and parameters. 
Thus, we know at least the algebraic sign of each of the first partial 
derivatives of the form f x . i or fj. It does not matter how we came 
about such knowledge; for example, we might confidently believe 
that individuals will divide an additional dollar of income frac- 
tionally between consumption and saving, not in consequence of 
some economic theory of maximization, but simply as a result of 
everyday observation. 

What then can we say concerning the direction of change of any 
particular variable in response to a change in some parameter? 
According to equation (7) of chapter ii, this is given by a compli- 
cated expression which can be written as the quotient of two n by 
n determinants, whose elements are made up of these first partial 
derivatives. Now in order for our question to have an immediate 
definite answer, we must be able to determine unambiguously the 
signs of each of these determinants. From the original funda- 
mental definition of a determinant, each consists of n ! terms, each 
term involving the product of n elements. From our hypothesized 
knowledge, we can be sure of the sign but not the magnitude of each 
term. Only if all of the n ! terms happen to be of the same sign 
will the sign of the determinant be unambiguous. If our system 
involves ten variables, the determinants will have more than three 
million terms. Regarded simply as a problem in probability, the 
chance that a run of this length should always have one sign is 
about one out of one with a million zeros after it. Therefore, unless 
some special feature is present, we are almost sure to find it neces- 
sary to weigh the magnitude of some terms against others, which 
is to enter upon quantitative problems admitting no answer by 
qualitative methods. 

Nevertheless, let us see how far qualitative methods will take 
us in the slightly simpler case where only one parameter is changed 
in such a manner as to alter only one of the equilibrium relations. 
As before, the signs of all first partial derivatives are known. Con- 
sideration of all possible cases that may arise will show that there 
is no loss in generality in assuming that it is the first equation which 
is shifted, that the partial derivative of each implicit equation with 
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respect to its own variable /*/ is always positive, and that the 
change in the first equation with respect to the given parameter is 
positive. If these conditions are not realized, they can be brought 
about by changes in sign of one or more of the equations or variables 
and the parameter. 

By out qualitative knowledge we are able to specify a matrix 
of the form 


+ 

zb 

zb • * • 

zb 

=t 

+ 

zb • • • 

dz 

zb 

zb 

+ ••• 

zb 


sign// = 


( 1 ) 


l± ± ± ••• +j 

where the n columns represent (x'i, ■ • • , x K ) , and the n rows repre- 
sent (J 1 , •••,/"). The sign in each element will represent the as- 
sumed known sign of the partial derivative of the variable corre- 
sponding to that row taken with respect to changes in the variable 
corresponding to that column. By our previous convention, we 
have made all diagonal elements plus. All the remaining elements 
can be of either sign, but in any one case they are assumed to be of 
definite known signs. All in all, there are a vast number of possible 
matrices of this kind which could arise in any problem, in fact ail < 
together 2 raised to the n(n — 1) power. 

We are interested in the direction of change of our unknowns, 
or in the signs of ( dxi/da , • • •, dx n /da). If nothing is known a 
priori, these may take on any one of all the possible 2* arrange- 
ments of signs. 


+ 

+ 

n 

+ !••• 

+ 1 

— 

+ 

+ ••• 

+ 

+ 

— 

+ ••• 

+ 

— 

— 

+ ••• 

+ 


2 ™ ( 2 ) 


+ ~ 


How many of these can now be eliminated as inadmissible on the 
basis of our qualitative knowledge as embodied in the specification. 
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of the matrix (1)? Ideally, we should hope to be able to rule out 
all but one of the possible combinations so as to get a unique answer. 
However, it would be at least desirable to be able to determine the 
sign of at least one of our unknowns, or, what comes to the same 
thing, to be able to rule out a particular half of all the possible 
combinations. 

So much for our aspirations ; turning to the exact procedure by 
which we rule out a combination, we find ourselves in for dis- 
appointment. If we substitute into equation (6) of chapter ii the 
signs of the indicated elements, we shall be able to rule out a 
combination if, and only if, it leads to a contradiction, i.e., if it does 
not add up to zero, or to a negative number as it should. 

Concretely, this can be seen to mean that we can rule out any 
one of the above combinations of sign in (2) which exactly dupli- 
cates any of the last in — 1) rows of (1), or which is the exact 
antithesis of any of these same rows. Otherwise, one of the last 
(n — 1) equations of (6), chapter ii, could not add up to zero as 
required by the present hypothesis. In addition, we may rule out 
any combination of (2) which exactly duplicates the first row; but 
we are not able to rule out its antithesis. 

AH told then, we are at most able to rule out by qualitative 
considerations only (In — 1) combinations out of a grand total of 
2" possible combinations. Even for moderate sizes of n this leaves 
us with all but a minute fraction of possibilities. And this is the 
largest number that we can rule out on such a hypothesis. In 
many cases we are not able"'to rule out even this many. Thus, if 
any two rows in our original matrix have exactly the same signs, 
they will both rule out the same combinations, which cannot there- 
fore be added together for fear of double counting. 

It can be seen then that purely qualitative considerations can- 
not take us very far as soon as the simple cases are left behind. 
Of course, if we are willing to make more rigid assumptions, either 
of a qualitative or quantitative kind, we may be able to improve 
matters somewhat. Ordinarily, the economist is not in possession 
of exact quantitative knowledge of the partial derivatives of his 
equilibrium conditions. None the less, if he is a good applied 
economist, he may have definite notions concerning the relative 
importance of different effects; the better his judgment in these 
matters, the better an economist he will be. These notions, which 
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are anything but a priori in their original derivation, may suggest 
to him the advisability of neglecting completely certain effects as 
being of a second order of magnitude. In other words, zeros are 
inserted into the matrix of (1). In fact, the so-called method of 
partial equilibrium consists of nothing more than a liberal sprinkling 
of zeros into the equations of general equilibrium. In the hands 
of a master practitioner, the method will yield useful results; if not 
handled with caution and delicacy, it can easily yield nonsensical 
conclusions. 

By simple extensions of the above argument we can show how 
the presence of a zero in any row permits that row to rule out four 
instead of only two combinations. Similarly, r zeros,in a row per- 
mits that row to eliminate 2 r+1 combinations. As before, there 
may be duplications in the eliminating influence of different rows. 
It is also apparent that definite knowledge concerning the sign of 
any one of the variables will enable us to rule out one-half of the 
original number of combinations, or 2” -1 combinations. Because 
of duplicating effects, definite knowledge of the signs of the changes 
in two of the unknowns will enable us to eliminate less than twice 
as many as one unknown; actually, by two known signs we can 
eliminate 3(2"~ 2 ) combinations in all. Definite knowledge of the 
signs of k unknowns will permit the elimination of all but 2 n ~ k 
combinations. 

It would be possible to illustrate the above calculus of quali- 
tative relations by reference to a number of well-known economic 
problems. Space will permit brief mention of a few. In the 
market illustration of the previous chapter 6 there was considered 
the case of a simple Marshallian partial equilibrium market in- 
volving two unknowns, price and quantity, whose equilibrium 
values are determined by the intersection of supply and demand 
schedules. We may apply our analysis to the determination of the 
changes in our variables resulting in an assumed shift of a nega- 
tively inclined demand curve. If the supply curve is known to be 
positively inclined, the sign matrix can be written in the form 

[- 

6 Page 17. 


+ ' 
+ . 


( 3 ) 
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In this case, we are able to rule out the maximum number of com- 
binations, (2 n — 1), or three in ail. Since there are only four 
combinations, we are left with a unique answer, as is apparent from 
the algebraic analysis of the previous chapter. If the supply curve 
is assumed to be negatively inclined, the signs in the second row 
both become plus, and we are able to rule out only the minimum 
number possible, or two. We are left then with a final ambiguity 
which cannot be settled except by quantitative knowledge or by 
means of various stability hypotheses. 

A more illuminating and more difficult example is that of the 
simplified Keynesian system, described in greater detail in chapter 
ix. Without going into details here, it can be stated that this gives 
rise to a system of three variables, interest rate, Xi, income, x%, and 
investment, x s , whose equilibrium values are determined by three 
equations, liquidity preference, f 1 , marginal efficiency schedule, p, 
and propensity to consume, f s . Making the usual assumptions 
concerning the signs of the various first order effects, e.g., that 
investment varies inversely with the rate of interest and that the 
effect of interest on consumption is of an order of magnitude which 
can be neglected, we end up with a sign matrix of the form 


(sign//') 


+ - 0 

- + - 

0 - + 


( 4 ) 


We may now consider a shift in any one of the schedules, e.g., 
a change in the liquidity preference schedule brought about by 
some policy measure. If we apply the above calculus, we are still 
left with three possible combinations for (sign change in Xi), 
namely, (+ + +), (— + +),or(— — — ). In the case of a shift 
of the marginal efficiency schedule, we narrow the choice down to 
two, (+ + +) or (— — — ). For a shift in the propensity to 
consume schedule we end up with three choices, (+ + +), 
(+ 4 )> or ( — ~ — )• 

It will be noted that in none of the cases are we able to make a 
definite statement about even one of the variables. In the second 
case, which is the most favorable, we can only say that an increase 
in the marginal efficiency schedule will either raise interest, income, 
and investment, or else lower all three of them. This is a highly 
unsatisfactory state of affairs, particularly since it seems on the 
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face of it doubtful that an increase in the marginal efficiency of 
capital should lower interest, income, and investment. But it is 
only by bringing in the stability considerations of later chapters 
that a more definite, and more reasonable deduction can be made. 
In the discussion in chapter ix it is shown that stability hypotheses 
will leave us with only one combination for the first case, ( — + +); 
for the second case, ( -f- H — {-) ; and for the third case the two possi- 
bilities, (H — f- +) or (+ H ). This last ambiguity is irremov- 

able unless still stronger quantitative assumptions are made. 

Numerous other economic examples could be mentioned, but 
these may be left to the reader. 7 Before returning to the problem 
of maximizing behavior, I should like to call attention to the fact 
that the qualitative calculus is not invariant under transformation 
of the variables. 

Maximum Conditions of Equilibrium 

Let us consider a new variable, s, defined by a single-valued 
function of our previous variables, 

3 1 , , Xny ^ 1 * i &7n) 1 

where f and its partial derivatives of at least second -order exist and 
are continuous in a wide region. If for any preassigned values of 
the a’s, there exists a set of values for the x’s, (,ri°, • • - , x n °), corre- 
sponding to which z is at a maximum, then we must have 

f(x 1 , • • • , x n , ai°, ■ • • , aj) S/(x i°, • • • , x„°, on 0 , ■ • • , a m °). (6) 

As a matter of notational convenience this may be written 

f(X,a°)^f(X 0 ,a°), (7) 

7 One last illustration taken from the field of international trade may be dealt with 
briefly. Professor Leontief has produced a numerical example illustrating the possibility 
that a unilateral payment from one country to another may so shift the terms of trade 
in favor of the paying country as to cause it to be better rather than worse off as a result 
of the transfer. “Note on the Pure Theory of Transfer/’ in Explorations in Economics 
(New York, 1936), pp. 84-92. The example is carefully framed so as to guarantee 
indifference curves of the proper curvature for both countries. However, if one sets up 
an analytical system, along the lines of the numerical example, one will find that the 
Leontief Effect can only happen for a system in which an increase in demand for a com- 
modity lowers rather than raises its price. If the latter phenomenon is ruled out as being 
anomalous, or incompatible with stability (defined arbitrarily or in terms of a dynamic 
set up), then we can by the same action rule out the possibility of the Leontief Effect . 
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where X stands for the arguments (* 1 , • • •, x„), and a stands for 
the arguments (a,, • • •, a m ). 

If s° represents an absolute maximum with respect to all ad- 
missible values of our independent variables, then 

f(X, <x°) < f(X°, J>). (8) 

On the other hand, z° may merely represent a maximum relative to 
all x’s lying in some restricted neighborhood of the point (X°, a 0 ). 

We know from Mathematical Appendix A, Section II, that in 
order for z° to enjoy a relative maximum it is necessary that 


dz 

dxi 


0 = fx t (,x i°. • • • . «1° 


a, 


O') 

* 


(i =!,•••,«) (9) 


and 

EEWMiS 0, (10) 

1 1 

where the h's are arbitrary numbers. For a regular relative maxi- 
mum the last condition can be written 


£ LU'hih; < 0, (11) 

1 1 

not all h’s equal to zero. In other words, this symmetrical homo- 
geneous quadratic form must be negative definite. (See Mathe- 
matical Appendix A, Section II.) 


Displacement of Equilibrium 

The set of equations in (9) can be regarded as our conditions of 
equilibrium corresponding to the set (1) of the second chapter, and 
may be assumed to yield an explicit solution for our unknown equi- 
librium values in terms of the preassigned parameters. 8 

Xi° = g l (ai 0 , • • •, a m °). (i — 1, •••,») (12) 

Using the method of the previous chapter we may easily solve 
for the rates of change of our solution values with respect to the 

8 From the implicit function theorem it is known that assuming the definiteness of 
tie quadratic form throughout the whole region under discussion will insure us of the 
uniqueness of our relative maximum position. Also our maximum conditions are essen- 
tially invariant under any non-singular transformation of variables. Furthermore, any 
properly monotonic function of z enjoys an extremum position for the same values of the 
arguments as does z. 
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kth. parameter by means of the following: 


f / 0 dx i _ 

r hiXi da k 

— f 0 

J X i a k 9 

(i = 1, • 


(13) 

where of course, 





- a i( a 0 . 

da k g “* ^ 1 ’ 

i ) • 

rH rH 

II II 

•••»») 

- • • , m) 

(14) 


As in (8) of chapter ii, our solution may be written in the de- 
terminantal form 


where 


dXj 

doth 


_ , 


f 0 
J* 1*1 


• • • f 0 

J*l x n 


H= ■ 


\u 


(15) 


(16) 


■f 0 ... f 0 

JVl JVn 


and Ha is the cofactor corresponding to the element of the fth row 
and the jth. column of the Hessian H. As will appear later, we 
are able in a large class of cases to evaluate the algebraic sign of 
this expression. In what follows where there is no risk of am- 
biguity, I shall omit the superscript zero. 

First let us derive a relationship of complete generality. Mul- 
tiply the ith. equation in (13) by d%i/ da t: respectively to get 


dXj 

da k i <Xi da k 


— fx, 


dXj 

dak 


(» = 1. ••*.») 


(17) 


Summing with respect to i over all equations we have 


” " doci dXj " , dxi 

T i JxiXi da k da k T 7 *** da k ' 

But from (11) 

S £ fxixhihj < 0, 

1 1 

holding for any h’s. In particular, for 


( 18 ) 
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we get 


or from (18) 


z 
1 1 


dXj dXj 
dak dak 


< 0, 



dxi 

da k 


> o, 


( 20 ) 

( 21 ) 


for not all dxi/dau equal to zero. In words this compound term 
consisting of the weighted sum of our unknown rates of change 
must definitely be positive in sign. However, this does not add 
very much to our knowledge since we do not know which terms 
will be positive. 

As mentioned earlier, however, we are not always interested in 
parameters vfhich shift each and every one of our equilibrium con- 
ditions. For this would necessitate a knowledge of the relative 
quantitative importance of each shift before we could hope to 
evaluate the composite result. For this reason we often narrow 
our problem by considering parameters which cause only one of 
our equilibrium equations to shift. Let us restrict our attention, 
therefore, to a set of such parameters (aq, • • • , a n ) equal in number 
to our unknowns. We may number each in the order correspond- 
ing to the respective equilibrium equation which it shifts. Then 
since a change in the /eth parameter must leave all other equations 
unchanged, we have 

/*,«* = 0, for j 9 * k. (22) 


Now our inequality of (21) reduces to the more simple and more 
readily applicable condition 


'St 


dXk 


*k«k 


daie 


> 0. 


( 23 ) 


In words, for the class of parameters now under consideration the 
rate of change of the kth variable with respect to its corresponding- 
parameter must be of the same sign as / Xjk « t , which in turn is positive 
if the shift of the equilibrium equation is in the direction of an 
increase in x k . 

This can be verified by the computation indicated in (15). 
According to our present hypothesis 
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In Mathematical Appendix A, Section III, it is shown that 

< 0, (6 = 1 ,•••,») (25) 

as a condition for a true maximum. Hence, 



Let us examine more closely the nature of our hypothesis em- 
bodied in (22) that each parameter shifts but one condition of 
equilibrium, leaving all others unchanged. In the first place this 
does not mean that a change in the ith. parameter results in a 
change in the ith variable alone. On the contrary, a change in 
any parameter will typically result in a change in all variables. 
Our hypothesis merely says that this must come about through a 
shift in but one schedule with movements along the remaining 
schedules. 

As I shall later argue, the assumption of this hypothesis does 
not involve any serious loss of generality and still includes the vast 
majority (in fact, it is hard to find exceptions) of relationships 
contained in current economic theory. 

The most general function for which the partial differential 
equations of (22) hold may be written 


z = 6(x i, ••■,*») + B\x i, on) + B 2 (x 2 , «*)+"• + B n (x n , a„). 
This may be verified by successive differentiation to obtain 
d 2 z . d 2 B i 


dXjdai 


— fxja k — 


dXjdak 


= 0, for j 7 * k. 


(28) 

(29) 


For the rest of this chapter unless otherwise indicated I shall 
consider functions of this restricted type. The inequality of (21) 
still holds in any case, but is less immediately applicable. After 
all, generality is not an end in itself. A theory may be so general 
as to be useless. It is for simple theories which have wide appli- 
cability that we must look. 
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Displacement of Quantity Maximized 

We have seen how the equilibrium quantities (# 1 , •••,x n ) 
change when parameters vary. Any function of them also varies 
in a determinable way. In particular, the quantity to be maxim- 
ized, z, will change; and its laws of change take on a very simple 
form. Let 

Z = f(x i, • • •, X n , a), (30) 


where a is any parameter. Let the x’s be functions of a determined 
by a maximum position, i.e., by 


dz 

dXi 


Then 


fi(x i, • • •, x n , a) = 0. (i = 1, ••■,«) 


dz i , dXi , „ i f _ dz 

d~a = ¥ i d^ +Ja -° +fa -^ 


(31) 

(32) 


That is, the first-order change in z is exactly equal to the change 
in z when the x’s are not varying optimally so as to keep z at a 
maximum ; only to terms of a higher order is there a difference in 
the way z is changing. A similar relation can be shown to hold 
in the case of a constrained maximum. 

This is the familiar relation of tangency between the envelope 
of a family of curves and the curves which it touches. It has many 
economic applications of which only a few need be mentioned. In 
the famous dispute between Professor Viner and his draftsman, 
Dr. Wong, the question arose as to the correct relationship between 
long and short-run curves. 9 The short-run curve is drawn up by 
minimizing total (and average) costs for each and every output, as 
of given amounts of some fixed factor. The long-run curve requires 
lowest total (and unit) costs for every output, as plant is optimally 
adjusted. But according to our theorem a change in the parameter 
output will to the first-order result in the same change in total (and 
unit) costs when plant is fixed as when plant is optimally adjusted. 
Dr. Wong was right then in insisting upon tangency. 

Another example is provided by Professor Viner. 10 Let labor- 
marginal cost be the rate of increase of total costs when only labor 

9 J. Viner, “Cost Curves and Supply Curves,” Zeitschfift fur Nationalokonomie, HI 
(1932), 23-46. 

10 J. Viner, Studies in the Theory of International Trade (New York: Harper, 1937), 
515-516. 
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is varying so as to produce the extra output. It is to be dis- 
tinguished from marginal cost of labor which appears in monopsony 
theory, and the marginal labor cost which means the increase in 
one component of costs as all factors are optimally varying. Let 
marginal cost be the rate of increase of total costs with respect to 
output as all factors vary optimally. Then our theorem states 
that labor-marginal cost equals marginal cost equals any other 
factor-marginal cost. As Professor Viner has pointed out with 
great insight, at the margin (i.e., neglecting differential coefficients 
of higher-orders) all factors are perfectly indifferent substitutes. 
The classical economists, lacking the precise notion of an infini- 
tesimal, were forced to employ the concept of a broad extensive 
margin. (Viz., Ricardo’s no-rent land, and J. B. Clark’s famous 
zone of indifference.) 11 

For finite movements, however small, higher-order terms will 
make the change in the quantity maximized (minimized) different 
when the unknowns are optimally adjusted from the change when 
they are held constant. Actually, 


d 2 z _ * , d 2 Xi “ dxi d{ft) . " . dxi 
da 2 ~ i Ji do? Y da da ' Z t Jia da 

= 0 + 0 + Ef ia ^+U, 


( 33 ) 


since /, = 0. 

The higher-order change in z when all x’s are held constant is 
given by 


dh 

da 2 


j oca' 


( 34 ) 


The difference between the former change and the latter is as 
follows : 


d?z , dxi 

dZ 2 ~ \ Ua da 


( 35 ) 


because of equation (21) above. 

If the changing parameter affects profits or ordinal utility ad- 
versely, it does so less when output and consumption are optimally 
adjusted to the new circumstances. If the parameter improves 

n For still other examples see Marshall’s Principles , Mathematical Appendix, 
Note XIV, pp. 846-852. 
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profits (etc.), it does so most when the unknowns are optimally 
adjusted. 

One could go on to still higher terms. These would be seen to 
depend upon the various partial derivatives of/ and upon dx { /d<x, 
d 2 Xi/da 2 , etc. The changes in z are of higher order than the changes 
in the x’s. In fact, the rath derivative of z depends at most upon 
the (ra — l)th derivative of the x’s. This is brought out in the 
above equations. 

Auxiliary Constraints and the Generalized 
Le Chatelier Principle 

If the equilibrium of a system is determined by extremum con- 
ditions where r all unknowns are independently variable, the addi- 
tion of auxiliary constraints (satisfied by the equilibrium position) 
will leave the equilibrium unchanged. If a true relative maximum 
is attained, a movement in any direction leads downhill; a fortiori 
movements along certain subsets of directions will be downhill. 
The usefulness of what may at first seem a strange process lies in 
the fact that it enables us to deduce necessary conditions which the 
equilibrium must satisfy. 

Thus, with plant fixed a variable factor will be hired until its 
wage (marginal cost) equals its marginal value productivity. In 
the long-run, plant cannot be taken as fixed. Nevertheless, the 
short-run condition holds in the long-run as well (but not vice 
versa) since long-run total costs cannot be at a minimum unless 
short-run total costs are as low as possible. Again, in discriminat- 
ing monopoly the condition that any given output be divided 
optimally between two markets (equal marginal revenues) holds 
even when total output is not given but is determined by cost 
considerations. 

How is the equilibrium displaced when there are no auxiliary 
constraints as compared to the case when constraints are imposed? 
How does the demand for a factor vary with its price when other 
factors cannot be optimally adjusted because of time lags, etc.? 
This type of question is important in thermodynamics as well as 
in economic systems. It admits of a simple answer. 

Let z be of the following special form 

% ^(xj, * * * , Xn ) «iXi a^xz ■ • * — gj„x,j . (36) 
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If all unknowns are independently variable, a maximum is deter- 
mined by the equilibrium conditions 


^ = 8i(x i°, • • •, x n ° ) - on = 0, (i = 1, • • •, n) (37) 

and 

m = 

is the matrix of a negative definite form. How does x* change when 
its “conjugate” parameter a,- varies? 


dx 2 
dai 


Hu 

H 


< 0. 


(38) 


Let r independent additional linear constraints be hn posed such 
that 


E ~ Xj°) = 0, (0 = 1, •••, r) (39) 

i 

where 

[g/] 

is of rank r. 

The introduction of constraints requires that our equilibrium 
system be modified to assume the following form 


di + E Rfig-P - on - 0, (i = 1, • • • , ») 

i 

E g/fe - Xj°) = 0, (0 = 1, • • r) 

1 


(40) 


where the X’s are Lagrangean undetermined multipliers. 

Let r H represent the determinant formed by bordering H with r 
rows and columns consisting of the coefficients g/; i.e., 


t H = 


M il 
gf o 


(0, y = 1, • - •, r) 


(41) 


Then the change in %i with respect to when r auxiliary con- 
straints are imposed, is given by 



if [H~] is the matrix of a negative definite form, and the matrix 
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~ is negative semi-definite. (Cf. Mathematical Appendix A, 

da.j 

Section IV.) Adopting the convention that 

H = °H, 


equation (38) fits in as a special case of (42). 

What is the effect upon the rate of change of with respect to 
cti of adding an additional constraint? Clearly 


/ dxj \ f dxj\ 

\ dcii J t \ d&i ) t — 1 


rR r-l H 

d-Iji VTj+r, 

T H r Hn+r t iz+r 

Hii n~f~r di H n j r r,n-^-r.ii 

T H r Hn-\-r, n+r 

~ m.» +f ) 2 

r H r H, 


r, 71+ r 


(43) 


by a well-known theorem on determinants. 

The denominator is negative because adjacent bordered prin- 
cipal minors must be of opposite sign. 12 Hence, the difference is 
positive. 

We have the following general theorem : 


( fj^A «= ( < 

\dai) 0 = \d<Xi)i = 



(44) 


While the change in an x with respect to its own parameter is 
always negative regardless of the number of constraints, it is most 
negative when there are no constraints, only less so when there is 
a single constraint, and so forth, until the number of auxiliary 
constraints reaches the maximum possible, namely (n — l). 13 

This explains why economically long-run demands are more 
elastic than those in the short-run. A lengthening of the time 

12 See Mathematical Appendix A, equation (48). It should be noted that our r 
corresponds to m there, and r there corresponds to n here. 

13 This is a purely mathematical theorem. It corresponds to some of the phenomena 
which fall under the heading of the celebrated principle of Le Cha teller. Because of the 
almost metaphysical vagueness of its formulation, the latter’s meaning is often in doubt, 
and it is used at one and the same time to cover diverse phenomena. The above formu- 
lation explains why the change in volume with respect to a given change in pressure is 
greater when temperature is constant than when entropy is held constant and tempera- 
ture is permitted to vary" in accordance with the conditions of equilibrium. 
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period so as to permit new factors to be varied will result in greater 
changes in the factor whose price has changed, regardless of whether 
the factors permitted to vary are complementary or competitive 
with the one whose price has changed. 


Economic Illustrations 


It is not hard to point out the relevance of the above for a large, 
number of economic problems. First, let us repeat the result of 
our previous analysis. 




(45) 


This states that the direction of the change of the «th variable 
with respect to its corresponding parameter is of the same sign as 
fx i a i (= B x . a f). This quantity may be taken as a criterion. If its 
algebraic sign is definite, i.e., unambiguously determinable, then 
the sign of dXi/da,- is also definite. 

We have merely to show, therefore, that a wide variety of eco- 
nomic problems can be so formulated as to yield a conclusive de- 
termination of the sign of our criterion. 

It may be noted that for maximum problems involving a single 
variable no restrictions need be placed upon z in order that our 
criterion be applicable. Let 

z = f(x, a). (46) 

Then our criterion becomes 

/~H> °- <47) 

I list below some random examples chosen to illustrate the 
applicability of the criterion. 

(a) Let us go back to the example of the second chapter of the 
effect of a unit tax on output. It will be remembered that the 
relationship between the equilibrium output and the tax was de- 
termined to be unambiguously negative. This conclusion can be 
quickly obtained by our present method. Profit is defined as 
follows : 

tt — <p(x, t ) = | jcp(x) — C(x) J — tx. (48) 

Our criterion may be easily computed. 


TTxt — tPxt 


1 . 


(49) 
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Therefore, 

T 

Cbi Q,« 

V 

p 

(50) 

or 

CD) Qp 

A 

P 

(51) 

Also, as 

shown in the previous section, 



dr dir 

It ~ ¥ ~ ~ x "’ 



To a first approximation, the change in profits is unaffected by 
the adjustment of output. To a more accurate approximation, 
profits are re'duced less if output is optimally altered. 

(b) The effects of three other kinds of taxes are also simply 
derivable. Consider respectively a percentage tax on gross sales, 
a lump-sum tax, and a percentage tax on profits. The correspond- 
ing profit functions may be written : 

7T = [_xp{x) — C(x)] — t'xp(x), 

7 r = [xp{x) — C(x)~] — t" (52) 

7T = [xp{x) - C(x)] - t m [xp{x) - C(x)J 


Our criteria are respectively 


TTxf 

7T xt" — — 0, 

7T xt r * * 


- C xp{x)l < 0, 


(53) 


chjx, t ,,r ) 

dx 


= 0 . 


Obviously, therefore, for the first case of a tax on gross sales, the 
effect of an increase in the tax rate is to reduce output. The last 
two cases, however, present a new feature. Our criterion is neither 
positive nor negative, but equal to zero. A little reflection reveals 
that our equilibrium maximum conditions are essentially inde- 
pendent of the parameters changed. Hence, our equilibrium 
values remain unchanged, i.e., 

dx 
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These conclusions are, of course, familiar from the Marshallian 
analysis. 

(c) Let us now consider a problem which has nothing to do with 
taxation, but which figured prominently in the famous cost con- 
troversy of a few years back. Let us suppose a firm in pure com- 
petition, i.e., one which can sell as much of its output as it wishes 
without affecting the price. Given total cost in function of output, 
there will be a determinable output reaction to each given price. 
What is the nature of this dependence? From the fact that the 
firm is assumed to be in stable equilibrium when it enjoys a proper 
relative maximum, we can easily apply our criterion to deduce the 
properties of the supply curve. Here 


7T = <p(x , p) ~ pX — C(x), 

(55) 

% - p - C'W = 0. 

(56) 

Equation (56) can be solved to determine output in 

function of 

price. 

(57) 

X = g(p). 

It is easily verified that 


7V xp = <Pxp L 

(58) 

Therefore, 


| " ^ 

(59) 


Even if we relax the requirement that a regular relative maximum 
be realized, it still remains true that the supply curve cannot be 
negatively inclined. Of course, this does not mean that the mar- 
ginal cost curve cannot be negatively inclined, but merely that in 
such ranges it cannot serve as a supply curve. 

(d) It must not be thought, however, that the assumption of 
our equilibrium as the solution of a maximum problem is the “open 
sesame” to the successful unambiguous determination of all possible 
questions which we may ask. For it is extremely easy to specify 
simple and important problems which cannot be answered even 
qualitatively without further knowledge. 

Consider the problem of the effect of the introduction of adver- 
tising expense upon the output of a monopolist. Will an increased 
advertising expense result in a larger or smaller output? Here 

ir = <p(x, a) — R(x, a) — C(x) — a, (60) 
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where 

R(x, a) = the maximum amount of total revenue which can be 
secured for a given output as of a given advertising 
expense optimally directed. 

C(x) = minimum total production cost in function of output. 
a = total advertising expense in dollars. 

For any given a, there is an optimal output which maximizes 
profit. What is the sign of dx/da ? 

Applying our criterion we have 


fxa -Rxay 

(61) 

>3 

gL 

V 

o 

(62) 


Thus, the direction of change of output depends upon the direction 
of shift of the marginal revenue schedule (upwards or downwards) 
as advertising changes. Now there is nothing in the formulation 
of the problem which requires that this shift be of any particular 
direction. Hence, short of quantitative empirical investigation of 
sales responses to advertising no presumption is possible. More- 
over, since there is ambiguity as to the instantaneous rate of direc- 
tion of quantity response to a change in advertising expense, there 
is a fortiori ambiguity as to the effect of a finite change in adver- 
tising expense. It is not possible, therefore, to state whether out- 
put will be larger or smaller under positive advertising expenditures 
as compared to no advertising expenditure. It may be pointed out 
that the effect of advertising upon price is also incapable of un- 
ambiguous inference, as one should intuitively expect from the 
arguments which have been presented on both sides of the case. 

(e) Another problem which has attracted much interest is that 
of whether output will be larger under discriminating monopoly 
or under simple monopoly. Suppose a firm with two markets with 
independent demand curves. 

x t = D'ipi), (i = 1, 2) (63) 

and a schedule of costs in function of total output, 

C — C(x i + Xi). 


(64) 
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Under discriminating monopoly all prices are regarded as inde- 
pendently variable and are adjusted so as to maximize profit. 
Profit may be written as 

x = p 1 D'(J> 1 ) + p 2 D\p 2 ) - C[D\p x ) + D*(Ps)J (65) 

For simplicity we rule out the possibility that all of the demand 
at a given price in a given market may not be satisfied by the 
entrepreneur. The removal of this restriction can be easily made. 
Here the conditions of equilibrium are 

w, = 0 - fl *(« + - C) g)- • 

This results in an optimal set of prices (pi, p 2 °) and quantities 
(xi°, x 2 °), and total quantity (X°), or (xi° -f- xJ). 

In the case of simple monopoly we impose the condition upon 
our problem that the price be equal in both markets. Thus, 

Pi = Pi - P, (67) 

r = pD l (p) + pD\p) - C[D'(p) + D*(p)2. (68) 

|-0 ,D W+ D W + (P-C')(^+^)- (69) 

This yields as a solution (p', p'), (xi, x 2 '), and (X') . Is it possible 
to determine whether 

X' $ Z°? (70) 

At first glance there is little resemblance between this and the 
examples hitherto discussed. No data have been introduced as 
parameters; furthermore, we are not comparing two situations in- 
definitely close to one another. On the contrary, we seem to be 
dealing with two entirely different kinds of behavior. The two 
solutions seem to result from qualitatively different types of maxi- 
mum problems. Nevertheless, we may avail ourselves of an arti- 
fice by means of which we can bring to bear the methods used above. 
Let us introduce a parameter k, defined as follows: 



( 71 ) 
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We may now treat as independent variables (pi, k ) instead of 
(pi, pi) since there is a one to one relationship between the two 
sets. Therefore, 

7T = F(p h pi) = F(p u kpi) = G(p u k). (72) 


In the case of discriminating monopoly both prices are varied inde- 
pendently so as to maximize profit. This is equivalent to the 
condition that both pi and k must be allowed to vary so as to 
maximize profit. Our conditions of equilibrium are 


dir _ dG _ - 
dpi dpi 

«■ dir dG _ _ 

dk = dk = 


(73) 


This must, of course, determine the same solution as given in (66) 
since we have only transformed our variables. 

For simple monopoly it is a preassigned condition of the prob- 
lem that both prices be the same, or that k be equal to unity. 
Thus, by continuous movement of k from unity to palpi', we may 
pass from simple to discriminating monopoly. If then we could 
unambiguously state the direction of the rate of change of output 
with respect to k at each point, it would be possible to determine 
whether output would be larger or smaller. 

In order to ascertain the rate of change of total output with 
respect to k, let us still further transform our variables as follows: 


X = DKPi) + D*(Pi). 

Solving inversely, we get 

Pi = fik, X), 

Pi = f(k, X). 

Thus, 

= F(pi, p^ = FUKk, X), f(k, X)] = v (k, X). 
Our equilibrium condition for a given k is 

dr djpjk, X) 

ax ~ ax ~ u - 


(74) 

(75) 

(76) 


( 77 ) 
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According to our criterion 

«*Sr> o- (78) 

Now the computation of <pxk, while laborious, is nevertheless 
possible. It will be found to depend in a complicated way upon 
the curvatures of our demand curves. Its algebraic sign will be 
ambiguous, as must therefore be the effect upon total output. 
However, anyone who cares to go through the computation can by 
so doing determine what additional restrictions must be placed 
upon the demand functions in order to insure that output be larger 
or smaller. 

(f) In later chapters the methods outlined hen? will be sys- 
tematically applied to different branches of theory. In order to 
illustrate that it is possible to derive unambiguous conclusions even 
with a large number of variables a small part of our later analysis 
will be anticipated here. 

Let us consider the demand of a firm for a factor of production. 
We assume as given a production function embodying the tech- 
nological relationships between inputs and output, the demand 
curve for the finished product, and the prices at which all factors 
of production can be bought in unlimited quantities. The condi- 
tion that profit be at a maximum is sufficient to determine the 
value of all unknowns in terms of these data. Thus, 

7 r = total revenue — total factor cost 
= R(x ) — (w i<h + •+-••• + w n a n ), (79) 

where Wi and a* are the respective prices and quantities of the it h 
factor of production. But we are also given a production function 
(assumed continuous), 

x = x(a i, • • •, &„). (80) 

Hence, 

7T = f(a u ■ • a n , Wi, ■ ■ - ,%v n ) 

= i?[>(ai, • • • , u«)] - (ma-i + • • • + w n a n ). (81) 

Recall from (21) our generalized criterion. 
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Obviously here 
Hence, 


But 

So 


> 

Ill 

p 

(i ^ j) 

ro" 

GO 

/<■<«< > °- (i =!,■■■, n) 

(84) 

f ^ * 

(i =!,•••,») 

(85) 

£<»• (i-i, 

(86) 


This conclusion holds for any number of factors. 

The above examples are but a small sample of economic prob- 
lems which can be regarded as determined by the solution of 
maxima problems. For these, the criteria outlined above may be 
employed to deduce meaningful unambiguous qualitative theorems. 


Analysis of Finite Changes 

Up until now the analysis has been confined almost exclusively 
to the determination of the algebraic signs of instantaneous rates 
of change. One cannot leave the matter here, for in the world of 
real phenomena all changes are necessarily finite, and instantaneous 
rates of change remain only limiting abstractions. It is imperative, 
therefore, that we develop the implications of our analysis for finite 
changes. Fortunately, despite the impression current among many 
economists that the calculus can only be applied to infinitesimal 
movements, this is easily done. 

Let us consider for simplicity a functional relationship between 
one variable, x, and one parameter, a, continuous and twice differ- 
entiable everywhere on a given interval. 

x = g{a). a S a ^ b (87) 

Suppose that we have already ascertained the fact that the alge- 
braic sign of the instantaneous rate of change of this function is 
everywhere negative on the defined interval; i.e., 

else 

- = g'(ce) <0. a (88) 

It follows then that any finite change in a within this interval will 
be accompanied by a finite change in x in the opposite sense. 
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For let 

Aa = a 1 — a 0 , 

Ax — x 1 — x° = g(a l ) 

We wish to prove that 

to 

VII VII 

ft o 

VII VII 

C3 

"So 

1 

(89) 

AxAa < 0. 

Aa 7 ^ 0. 

(90) 

By the theorem of the mean 



Ax = g(a*) - g(a°) = 

■ 1 g\^)da = g'(e)Ao:, 

ct 0 

(91) 

where 

e = a 0 + — 

a 0 ). 0 < d < 1 



But since the derivative g' is negative everywhere on the defined 
interval, it must be negative for x — e. Therefore, Ax and Aa are 
of opposite sign, for 

AxAa = g'(e)( Aa) 2 < 0. (92) 

This result is intuitively obvious. If, starting from a given 
point, a curve is always falling, then one should not expect it to be 
higher after a finite distance, continuity being assumed. 

The above proof rested on the assumption that the derivative 
be unambiguously negative everywhere. This is often an ad- 
missible hypothesis as we have seen. But what about the cases 
in which the derivative is known to be unambiguously negative 
only at a point? Thus, 

gV) <0. a < a° < b (93) 

Can anything be said then about finite changes? Our answer is 
in the affirmative. It can be shown that for all finite changes in a 
smaller than some assignable quantity, there exist corresponding finite 
changes in x of the opposite sense. 

For by hypothesis g'{a) is continuous on the given interval, 
and of course at a 0 . Hence, from the elementary definition of 
continuity there exists a neighborhood around a 0 in which g'(a) 
is always negative, i.e., 

g'( a) <0. j a — a 0 1 < h (94) 

Thus, from our previous theorem 

Ax A a — (x — x°) (a — a 0 ) < 0. | a — a 0 j < h (95) 
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The implication of the last theorem is important. In order 
that the results of all finite movements be unambiguous, it is 
necessary that the sign of the instantaneous derivative be definite 
everywhere. For let the derivative change sign, and it will be 
possible to find contradictory finite movements. 


Thus far we have assumed that our equilibrium equations are 
such as to enable us to solve for our unknowns uniquely in terms 
of our parameters. Under what conditions will this be possible? 
It will be readily seen that even where our equilibrium conditions 
are defined as the result of a maximum problem there remains the 
possibility of multiple relative maximum positions. Under what 
conditions a*e we able to get a unique explicit solution of our 
implicit equations? What shall be done if multiple solutions are 
possible? 

Assume the implicit equations 

f = f Xi (x 1 , • • • , x n , a h • • • , ctm) =0, (* = 1 , ■ ■ • , n) (96) 


and a set of values (x/ 1 , • • • , x n °, ai°, • • • , a m °) satisfying these equa- 
tions. It is known from the Implicit Function Theorem 14 that 
there exists one and only one explicit solution of the form 

Xi = g%a i, • • •, a m ), (i = 1, • • •, n) (97) 


in a region around (xi°, • • • , x n \ cn°, • • • , «,„") where the following- 
functional determinant does not vanish, 


H = \U,.\ = 


f x l x l fx!X 2 
2*^1 


/w f x n x i 


f x l x n 

fx 2 X „ 


k 


^ o 


(98) 


Since we shall be considering chiefly the selection of regular 
maximum positions, we may assume 

(- 1 ) n H° = (- l) n \f X{Xj °\ > 0 (99) 

at the point of equilibrium. Providing this expression remains 
positive everywhere, we may be sure of a unique equilibrium. Of 
course, this is sufficient, but not necessary. 

14 See any Advanced Calculus. 
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Suppose, however, that the Hessian does change sign a finite 
number of times within the region of economically admissible 
values. Then the functions in (97) will be multiple-valued with a 
finite number of branches. Some of these may be ruled out im- 
mediately as not constituting maximum positions, namely those 
for which 

(— l) n H < 0. (100) 

It is possible to choose from the remaining branches only by re- 
ferring back to our original maximum problem. Let (xd, • • • , x n *) , 
(xi 2 , • • • , x n r ) , • • • correspond to a preassigned set (on, • • • , a m ) as 
multiple solutions of (96). That set (or sets) is retained for which 
/ is greatest. This will ordinarily serve to define our x’s as single- 
valued functions of the a’s except at a finite number of points. 
At these points it is a matter of indifference which of the possible 
alternative solutions is embraced. 

Broadly speaking this possibility of multiple equilibrium offers 
no serious difficulty. For all qualitative results remain. To illus- 
trate this let us consider the problem in full generality. Let 

Z = f(xi, ■■ - ,Xn,a 1 , •■•,«„) = 6{x 1 , • • •> X H ) 4~ B\x i, «i) 

+ B 2 (x 2 , ai) +•••+- B n (x n , a n ). (101) 

Consider a preassigned set of values of our parameters (ad, • • * , a n °) , 
and a corresponding optimal set of values for our unknowns 
(xi°, • • • , %n ) , not assumed to be necessarily unique. Then by 
definition of a maximum 

fix i, • • • , x n , ad, • • • , a n °) ^ f(x i°, • • • , x n °, ad, • - • , a„°) (102) 

where (xi, • • • , x n ) take on any values. For brevity this may be 
written 

f(X, cfi) - f(X°, ofi) -3 0. (103) 

Consider any other preassigned values of our parameters (ad, 
• • • , an 1 ), and a corresponding optimal set (xd, • • •, Xn 1 ). Then 

f(X\ a 1 ) - /(Z°, a 1 ) S 0, (104) 

f(X°, «°) - a°) iS 0. 

Adding both sides, we get 

[/(X\ a 1 ) - fiX*, a 1 )] - [/(X\ a 0 ) - }{X\ a»)] is 0. (105) 
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From the definition of a definite integral this can be written 

n pH 1 

E I * i CC \^ 9 ’ > Ctrl ) 

— /*X x i> • • •» *»> ai°, • • an°)]dx.£ S 0. (106) 

From the same definition this can be further transformed 

E E f f fxiaXx 1 , • • -,x n , ai, • • - , a n )dXidoij S 0. (107) 

1 1 d ctj 0 

But from (101) 


/*,.«, = B x . a i = 0, (-7 ^ i) 

~ Bxial, 


(108) 


so 


n ri i 

^ I | fx^ix 1, * , &n, OU, * , GijX)dXidoLi ~d 0. (109) 

1 %J *J ap 

By the theorem of the mean this may be written 


E fx ia AXiAcxi S 0, 

1 


( 110 ) 


where / Xja< is evaluated at an intermediate point. If we consider 
movements of the kth. parameter alone, others being held constant, 
this becomes 

} XkCth Ax k A<Xk >0. (k = 1, • • •, n) (111) 

Hence, if our criterion is definite in sign everywhere, e.g., 

/m = < 0 . (k = 1 ,-•-,«) ( 112 ) 

then, 

AxfcA a.k SO. (fe = 1, •••,») (113) 

This is proof that multiplicity of equilibrium values does not 
alter the definiteness of our conclusions with respect to finite 
changes. It also shows that the criterion which we applied in 
earlier sections to determine the definiteness of instantaneous rates 
of change can be generalized to determine the definiteness of finite 
changes. In fact, by a proper limiting process our previous 
theorems relating to instantaneous rates of change can be deduced 
as special cases from this more general analysis. 

The equality sign can be dropped if Aa 9 ^ 0, and if the maxi- 
mum is proper. It will be noted that in this proof we did not 
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require that 6 be continuous, but merely that the B i be continuous 
with the derivatives of the required order. This will turn out to 
be of considerable importance in the later consideration of discon- 
tinuities in the production function of a firm. 

A very important case is that in which z takes the form 


Then 


or 


& 6(x ij * j “£71) 23 Ot-iXi* 

1 

(114) 

^(.T 1 ) — X S 9(x°) — 

L OCi'-Xi 0 , 

(US) 

1 

1 


n 

n ** 


0(*°) - X ^ 0(x l ) - 

22 a^Xi 1 . 

(116) 

1 

1 


fleeting terms, we get 



X (on 1 - cii°)(Xi l - x?) 
1 

= 0, 

(117) 

X AotiAxi S 0 . 


(118) 


As illustration of the direct application of our methods to finite 
changes, let us revert back to the firm with a given total cost func- 
tion producing in a purely competitive market. For a preassigned 
price, p°, the firm will supply a given output, x° (not necessarily 
uniquely defined, since the firm may be indifferent in its selection 
among two or more outputs). Since profit is at a maximum for 
this output 

ZPW - C^ 1 )] - [>°x° - a* 0 )] s 0, (119) 

where x 1 may be any output, in particular that output appropriate 
to a second price, p 1 , such that 

lp'x° - C(x 0 )] - [jAd - a* 1 )] 0. (120) 


Adding we get 


and 


ApAx = ( p 1 — p^ix 1 — x°) Si 0, 

A pAx < 0 for Ax 5 ^ 0, A p 5 * 0. 


( 121 ) 

( 122 ) 
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For this analysis it is not necessary that the cost curve be con- 
tinuous, or be such as to yield a unique optimum output. The 
marginal cost curve may be undefined at points, have kinks, and 
turn up and down many times. 


Analytic Functions 

In cases where z is an analytic function of the .Vs and the a’s, 
finite changes can be evaluated by an infinite power series in the 
Act’s, whose coefficients depend upon partial derivatives of all 
orders of z at the original position of equilibrium. That is, 


Ax 


(i =!,■■■, n) (123) 

The general coefficient is of the form 


/ d n Xj \° 

• • • da,r n ) ’ 


where 


Z m. = n. (124) 

These coefficients can be computed from the equilibrium equations 
by differentiating as many times as is necessary with respect to 
the a’s. If the Hessian is not zero, this will yield recursion rela- 
tions sufficient to get the desired higher derivatives in terms of the 
already computed lower derivatives and the higher partial deriva- 
tives with respect to the x’s. 

Convertibility into a Maximum Problem 

Earlier in this chapter it was indicated that some problems 
which do not appear to involve extremum positions can at times 
be converted into an equivalent maximum or minimum problem. 
The advantage to be derived from so doing is merely notational, 
since it will require fully as much knowledge to ascertain whether 
the conditions of a maximum position are met as would be neces- 
sary to answer any questions which might be asked. Furthermore, 
there is the danger that unwarranted teleological and normative 
welfare significance will be attributed to a position of equilibrium 
so defined. To avoid misunderstanding it is well to emphasize that 
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the conversion of a problem whose economic context does not 
suggest any human, purposive, maximizing behavior into a maxi- 
mum problem is to be regarded as merely a technical device for the 
purpose of quickly developing the properties of that equilibrium 
position. 

Our problem may be stated in the following way. Given initial 
conditions of equilibrium 

f(x u *, Xnj OLm) ” 0 , if ~ 1 , * * *, ?z) ( 125 ) 

such that our unknowns are determined in function of the given 
parameters, namely, 

%i = g*(ai, • • •, a m ), (i = 1, ■■■, n) (126) 

under what conditions can the set (125) be regarded as the solution 
of an extremum problem so that the n indicated equilibrium loci 
correspond to the vanishing of the partial derivatives of some func- 
tion? That is, does there exist a function 

2 = fipc i, • • •, x n , «i, • • •, a m ) (127) 

such that 

fxfx l, ' , X n , OL\, * , Ot-in) “0, (f — 1 , ■ * • , 7l) (128) 

represents the same locus for each value of i as 

fix i, • • • , x n , di, • • • , «m) = 0? (i — 1, • • • , n) (129) 

Now the same implicit function may be represented in an in- 
finity of ways without changing the locus it denotes. 15 It is de- 
sirable, therefore, that we represent the loci which are our condi- 
tions of equilibrium in a definite unambiguous form. One such 
way is provided by solving explicitly for each variable in turn, 

Xi — J\£*(x i, * , Xi — i, ‘ * , Xn ) . if 1, , vt) (130) 

What conditions are necessary on these functions or on the func- 
tions f so that there exists a z as defined by equations (127) 
and (128)? 

Define 


f dxj \° _ | 

fdM i v (f/y 

\ dxj ) i 

i d Xj ) ~ Uv 


15 For example, (/*) 2 = 0, (i — 1, * * * } n) or F(f l ) — 0, where F(0) = 0; F(a) 9* 0, 
a 0 . 
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The X’s are unambiguously determinable independently of the 
representation of the /’ s. In general, 

If there exists a function z, the vanishing of whose partial deriva- 
tives is equivalent to the equations (125), then 


"x — Si . 

A ij £ 

J a 

Also for every possible triplet 


X , jX jlNk i 


lilMM. - SIMM \ \ ■ 

MMfu 0 u r 


(132) 


(133) 


because fa = fa for all possible pairs. 

These conditions are necessary. They are not identities for 
all values of (xi, • • • , x„), but only for those satisfying 


f(xi, •••,»„)= 0 ; (i = 1 , • ■ •, n ) 

i.e., they hold only at the point (xi°, • • •, x n 0 ). 16 They are not to 
be confused with the integrability conditions of chapter v. Though 
necessary, they are probably not sufficient. Quite possibly similar 
point reciprocity relations can be found between various derivatives 
of higher order. If all of these should hold, perhaps a complete set 
of sufficient conditions would be attained. 

The relationships given in equation (133) are vacuous if the num- 
ber of variables is less than three. All together n(n — 1 ){n — 2)/6 
independent conditions are involved. 17 

16 In addition, for all values of {x) satisfying a subset of r(< n) equations f* = 0, 
tliere will be defined similar relations which are identities in the remaining (n — r) 
variables. 

17 Any completely non-specialized set of n equations in n variables, /*(&i, * * ■ , x„) = 0, 
can be regarded as equivalent to a stationary position of a function of 2 n variables. 
Let 

n 

jF(%i f * * , Xnl Xn+n) “ f*(X i, * * * , Xn'jXn+i* 

l 

dF = 0 implies among other things that f*(x i, **♦,&») = 0. This is definitely not an 
extremum position as shown by reference to the secondary conditions. The fact that 
in a larger set of variables a stationary value corresponds to a non-specialized system 
seems devoid of economic significance. See G. D. Birkhoff, Dynamical Systems (New 
York: 1927), pp. 33-34. 
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As we have seen, it is not enough that our equilibrium equations 
be expressible as the partial derivatives of some function. In order 
to be able to derive definite theorems it is desirable that this func- 
tion be at a regular maximum or minimum. This requires as is 
shown in Mathematical Appendix A, Section II, that certain 
quadratic forms be either negative or positive definite. Thus, for 
a maximum 

H H fuhihj < 0, not all hi — 0. (134) 

i i 

As shown in Appendix A, Section III, this requires that the follow- 
ing inequalities hold. 

x , fn f 12 fn\ 

I/nl < 0; >0; ft i fit fn <0; etc. (135) 

J 21 J 22 r r r 

J 31 J 32 J 33 

For a regular minimum these determinants are all positive. It can 
be easily shown that either of these is equivalent to the following 
conditions. 

a 1 ^ ij 'h-ik 

. f > 0; 1 \jk > 0; etc. (136) 

for i, j, and k all unequal. This proves that it is not important 
that our problem be either a maximum or minimum one, but only 
that it be one or the other. 

As an example of a problem which can artificially be converted 
into an equivalent maximum problem, consider a number of inde- 
pendent firms buying the same kinds of productive services in 
purely competitive markets. 18 The demand of any firm for the 
factors of production can be written 

Vi = r { (p lt • • • , pn), (i = 1, •• •,») (137) 

where (»i, •••,»„) represent respective amounts of n factors of pro- 
duction, and (pi, • • • , p n ) their respective prices. It can be shown 

18 H. Hotelling, “Edgeworth’s Taxation Paradox and the Nature of Demand and 
Supply Functions,” Journal of Political Economy , XL (1932), 577-616. L. Court, “In- 
variable Classical Stability of Entrepreneurial Demand and Supply Functions,” Quarterly 
Journal of Economics , LVI (1941), 134-144. 
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dVj _ dVj_ 
dpj ~ dpi 


( 138 ) 


Summing over al! the individual demand functions, we have the 
following n general demand functions. 


n 

V - = R-KPu ■ ■ ■> Pn), (* = 1 , •••,») ( 139 ) 

i 


where the capital letters stand for the total amounts demanded. 
From (138) 


dRi dlV 

dpj ~~ dpi 


( 140 ) 


There exists, therefore, a function 


Z — R(pl, • • • , pn) ~ (Vipi + V 2pl Vn pn), 

where 


dR 

dpi 


= R%Pl, • • - ,Pn). 


O' = 1, • • •, n) 


( 141 ) 

( 142 ) 


The conditions of equilibrium represented by the general demand 
functions of (139) are, therefore, equivalent to those derived from 
the condition that Z be at a maximum with respect to the p’s, i.e., 


dZ_ 

dpi 


f?‘ - F,- s 0. 0 = 1, • • ■ , n) 


( 143 ) 



CHAPTER IV 


A COMPREHENSIVE RESTATEMENT OF THE 
THEORY OF COST AND PRODUCTION 

Economic theory as taught in the textbooks has often tended to 
become segmentalized into loosely integrated compartments, such 
as production, value, and distribution. There are, no doubt, peda- 
gogical advantages in such a treatment, and yet something of the 
essential unity and interdependence of economic forces is lost in so 
doing. A case in point is the conventional assuming of a cost 
curve for each firm and the working out of its optimum output with 
respect to its demand conditions. Only later is the problem of the 
purchase of factors of production by the firm investigated, and 
often its connection with the previous process is not clearly 
brought out. 

I should like here to investigate from the point of view of the 
previous chapters the cost curves of the firm as usually presented 
and the production function embodying technical relations between 
inputs and output which lie behind it, and also to show clearly its 
relevance to the problem of the determination of the optimum 
output. In particular, I shall attempt to derive all possible opera- 
tionally meaningful theorems. Much of what is said will be found 
to apply regardless of the elasticity of the demand curve for the 
firm; i.e., under “impure” as well as “pure” competition. By em- 
ploying a suitable notation it is possible from a purely technical 
point of view to consider the case of any number of productive 
factors as easily as one or two. 

Statement of Problems 

In the beginning the revenue side of the firm is completely 
ignored. We assume as given by technical considerations the 
maximum amount of output, x, which can be produced from any 
given set of inputs (v u This catalogue of possibilities is 

the production function and may be written 
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In general, there will be a maximum output for each set of inputs, 
and so this function is single-valued, and will be assumed initially 
to have continuous partial derivatives of desired order. Further- 
more, no marginal physical productivities can be negative, else 
output would not be maximal since it could be improved for the 
same set of factors by leaving some idle. Regardless of this con- 
sideration, in the relevant region under consideration we will have 

<Pi SO, O' = 1, • • •, n) (2) 

where as a convention of notation 

dx „ 

<pi — j- — marginal physical (degree of) productivity. Simi- 

00 i 

larly, we define 

dfx 

~ dVidvj 

It is also assumed that each firm is small relative to the market 
for each input so that unlimited amounts of each can be bought 
at the respective prices (w x , • • ■ , w n ) . 

As a matter of definition the total cost of the firm may be 
written as the sum of the costs for each input item and any other 
costs which are independent of the purchase of the designated 
inputs and output; i.e., 

C - A + £ wan, (3) 

where A represents costs which do not vary with the designated 
inputs and output (taxes, etc.). Of course, these fixed charges 
may be zero. 

A brief survey of our field by economic intuition will aid in 
phrasing the problems to be investigated, after which mathematical 
analysis may be employed in stating the conditions imposed on our 
various functions. Our aim is to derive the total cost for each 
output. More precisely, with given prices of productive factors 
and given production function, we are interested in deriving the 
minimum total cost for each output. This will be a function as 
follows: 

C = A + V(x, w h • • • , w n ). (4) 

If we regard the prices of the productive factors as constant, the 
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resulting relationship between C and x is the usual total cost curve, 
from which average and marginal curves can be derived. 

The question arises as to the relationship between (3) and (4) . 
It is clear that the same output can be produced by an infinity of 
combinations of productive factors so that in the absence of other 
considerations it is impossible to determine the total cost uniquely 
for each output. In view of the consideration that total cost for 
each output be a minimum, our indeterminacy disappears. Out 
of the set of all possible input combinations which will produce a 
given output, that combination is selected which minimizes the 
total cost defined in (3). In other words, with given prices of pro- 
ductive factors and for a preassigned output, there is an optimum 
value for each productive factor; i.e., 

Vi = /*’(*, Wi, ■ ■ •, Wn). (i - 1 (5) 

By substitution into (3), we have 

n 

C = A + £ Wif(x, wi , ■ ■ ■ ,w n ) = A + V(x, wi, ■ • ■ , w„). (6) 

i 

Thus, the relationship between (3) and (4) is revealed. 

It is part of the purpose of this book to investigate the proper- 
ties of the functions (4) and (5). It is true that theoretical eco- 
nomics does not deal with particular forms of functions (e.g., 
polynomials, etc.). However, it does concern itself with the 
general character of various functions ; that is, their slopes, curva- 
ture, etc. In this case we are interested in the following properties 
of these functions: 

dC P£ dC_ PC n) 

dx ’ dx z ’ dWj ’ dxdwj ’ ^ 

i.e., how are total and marginal costs affected by changes in output 
or prices of productive services? On what properties of the pro- 
duction function does this depend? 

We are also interested in 

dVj dVj dVj _ ' 

dWi ’ dWj ' dx ’ , 

i.e., what is the reaction of demand for a productive service with 
a change in its own price? With a change in another price? With 
a change in output? 
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These are obviously important theoretical questions, and yet, 
curiously enough, the answers to some of them do not appear to be 
in the literature. In the next section these will be mathematically 
evaluated, and the results summarized. It will be found that 
under very general assumptions the knowledge of the signs of these 
differential quotients will give us the direction of change with 
respect not only to sufficiently small finite movements, but also to 
finite movements of any size. 

Conditions of Equilibrium 

Thus far we have employed only the notation of mathematics. 
By so doing, the problem has been clearly framed, which in many 
economic problems is more than half the battle. It remains now 
to state the derivations of (4) and (5) from (1) and (3). 

Our problem is to minimize 

C - A + Y. WiVi (9) 

i 

subject to 

<p{v i, •••,!>„) = x = constant. (10) 

Mathematically, this is a constrained minimum problem, and we 
may avail ourselves of the method of the Lagrangean (undeter- 
mined) multiplier. We define a new function 

G - A + £) WiVi - XO(»i, •••,»„) - x], (11) 

1 

where (— X) is a Lagrangean multiplier, whose economic interpre- 
tation will be brought out later. G may be regarded as a function 
of all the inputs as independent variables. It is necessary for a 
proper relative minimum that 

dG 

— = 0 = Wi — \<pi. (i = 1 , •••,«) ( 12 ) 

These may be rewritten as 

This is the well-known economic theorem that in order for total costs 
to he a minimum for any given output, the marginal productivity of 
the last dollar ( 1 /X) must he equal in every use. Alternatively, it may 


< £n 

wf 


(13) 



THEORY OF COST AND PRODUCTION 


6l 


be stated that the marginal physical productivity of any factor 
must be proportional to the price at which it can be hired, the 
factor of proportionality being the term X. 1 It will be -noted that 
this condition is independent of the revenue curve of the firm and must 
hold at every point on the cost curve, not only at the final point of 
optimal output. 

Provided that certain secondary conditions to be discussed 
presently hold, the n equations in (12) and the one equation in 
(16) are sufficient to determine each of our (« •+ 1) unknowns 
(vi, • • • , v n , X) in terms of x and the factor prices (wi, • • • , w n ) re- 
garded as parameters. Thus, our equations (5) are defined im- 
plicitly by these minimum conditions. 

One may well question the advantage of this formulation. Ap- 
parently we have substituted an indirect statement of our final 
conditions for a direct one. But, and this is typical of all sound 
economic theory, we are trying to deduce the consequences of our 
hypothesized data, and we do know (by hypothesis) much about 
the functions in (12). Merely to state the relations in (4) and (5) 
is formal and empty. By means of (12) we can place positive 
restrictions upon them and know their general properties. 

Now there remains the purely mathematical problem of trans- 
lating our assumptions into terms of the functions (4) and (5) , and 
computing their respective partial derivatives designated in (7) 
and (8). 

Secondary Extremum Conditions 

To do so we must first state all that we know about the condi- 
tions defined in (12), in particular the secondary necessary and 
sufficient conditions for a proper relative constrained minimum. 
It is clear that in order for total cost to be a minimum for a pre- 
assigned x, the locus of all possible inputs yielding that preassigned 
quantity (isoquant surface) must be tangential to a locus of all 
possible inputs yielding the same total cost (isocost plane). But 
of course this is not enough. The isoquants must also be convex 
to the origin in all directions in order that its contact with the 
isocost plane represent a true proper minimum. The analogy with 
the theory of consumer’s preference suggests itself. This is brought 
out more clearly if we phrase the problem not as that of minimizing 

1 Later X will be shown to be equal to marginal cost, i.e., dC/dx. 
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total cost for a preassigned output, but rather in the equivalent 
form of maximizing output for any preassigned total expenditure. 
Mathematically, our secondary conditions are 


£ £ <Pi£&> < 0, 
1 1 


(14) 


for 


and not all 


£ 

i 


o, 


fc = o. 

Consider the bordered determinant 


D 


<Pii\ 

<Pi 


<Pi 

0 


<Pu 

£12 

<Pin 

£ i 

£21 

£22 

(pin 

£2 

£n 1 

£n 2 

<Pnn 

(pn 

£i 

ip 2 * ’ 

(Pn 

0 


(IS) 


and respective principal minors 


2)12 


£ n 
£>21 
£l 


(pn 

ip 22 
£>2 


£ 1 
<P% 
0 


2)123 


£ll 

£21 

£31 

£l 


£12 

£22 

£32 

£2 


£13 

£23 

£33 

£3 


£i 

£2 

£3 

0 


etc. (16) 


It is well known 2 that (14) implies that any such minor of the 
order m must have the sign of (— l)” -1 , and conversely: i.e., 


Specifically, 


(- l) m ~ 1 Z> 12 ... (m " 1) >0. (m S n 4- 1) 


(17) 


~ 2 <pij<pi<Pj 4* <Pjj<Pi 2 < 0; etc. (i ^ j) 


It will be noted that this condition does not necessarily imply or 
require the law of diminishing marginal physical productivity 
to hold. 3 

2 Compare H. Hotelling, “Demand Functions with Limited Budgets,” Econometrica , 
January, 1935, pp. 66-78. 

3 1 am assuming that these secondary conditions hold not only at the minimum 
point, but everywhere. Mathematically, this assures us of the uniqueness of our equi- 
librium, since this stronger assumption definitely rules out multiple relative minima* 



THEORY OF COST AND PRODUCTION 


The secondary conditions are not always mentioned in the 
literature. It is not for elegance or completeness that they are 
included here, but rather because they are wholly relevant to the 
problem under discussion, since it is upon them that all our results 
depend. 

Displacement of Equilibrium 

It is now possible to derive in compact form the rates of change 
of our dependent variables (vi, •••,»„) with respect to changes in 
the variables (x, W\, ■ • ■ , w n ). The reader not interested in the 
mathematical derivation of these conditions is referred to a sum- 
mary of results at the end of this section. First we write the total 
differential of our equilibrium equations (12) and (10)1 


it < pijdvj + ■— dX 


dw-i 

X 


(i = !,•••, n) 


(18) 


X! ipjdvj = dx. 

i 

These are ( n + 1) linear equations in in -f- 1) unknowns 
(dv i, • • • , dv n , dX), and may be solved in determinantal notation as 
follows : 


dv k 


Na + dxA n+1 , k 

i A 


where 


A = 


i ‘Pij 1 
Pi 


<£i 

X 

0 



A 


' r 


<pll 

<Pl 2 


< Pin 

X 

<P2l 

(p 22 


<P2n 

ip 2 

X 

<Pnl 

<Pnl 

. . . 

<i Pun 

<Pn 

X 

<Pl 

<P2 


<Pn 

0 


(19) 


( 20 ) 


and A qr is the cofactor of the element in the rth column and the 
gth row. 
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Also, 


n 

L 

—r~ A^n-f-l ~f” i,n+l 

K 

(21) 

d\ — — 

A 

Hence, 

dVk Ajfc 

dWj XA 

(22) 

As a special case 

Cl>) Qj 

lb 

II 

>\g 

(23) 

Also, 

dVk An+1,* 

dx A 

(24) 

and 

(9 X Afc,n-f- 1 

dm XA ’ 

(25) 


dX An+l,n+l 

dx ~ A 

(26) 

It is clear on inspection 

of the determinant A, that 



t> 

II 

(27) 

Also 



1 _ 

A jlc ^ ID jk 

11 

<1 

11 

(28) 


— ■D/.n+l XA rt -j-l,;, 0 ^ ’ 


and 

Therefore, from (28) 


n) 


(29) 

(30) 


( 31 ) 


dVk_ _ dVj 

dWj dWk 

That is to say, the change in the kth factor with respect to a change 
in the jth price, output being constant, must be equal to the change in 
the jth factor with respect to the kth price, output being constant; a 
result which is not intuitively obvious. 

From (27) and (28) 

Ajk 

A “ D ' 


( 32 ) 
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But from our stability conditions in (17) 

< 0. (j = 1, •••,«) (33) 

Therefore, 

f|<0. 0 = 1, •••,») (34) 

That Is, fixed output will always be produced with less oj any 
given factor as its respective price rises f other prices not changing . 
By the law of the mean this can be shown to hold for finite changes . 

Let us now determine the economic meaning of X. Rewrit- 
ing (11) 

n _ 

G = A + Z — x], 

1 

and differentiating G, which is total cost with a term added on, 
we get 


This suggests that X may be marginal cost. This can be proved 


rigorously in two ways. Of course, 


n 

dC - Z Widvi, 

1 

(35) 

and 


n 

dx = Z Pid-V;. 

1 

(36) 

Dividing (35) by (36), we get 


» 

Z iMidVi 

dx Z v4v t 

1 

Substituting from (12), 

n 

. ~ Z X< Pidvi 

— - * — X(», w.x, •••,%). 

Z vdvi 

(37) 

(33) 


1 


foundations of economic analysis 


More rigorously the proof is as follows: 


ac_ A W 'dVi. 
dx 1 1 dx 

(39) 

Substituting from (24), 


dC A An+l.i x £ 

S' = 4 4 

(40) 

But expanding A in terms of the elements of tne last row, 


n 

A = Tl <r , An+l,2* 

1 

(41) 

Hence ’ ‘ ac_ a_, 

dx 4 

(42) 

Therefore, we may rewrite (12) as follows: 


ac 

Wi = aT ** 

(43) 

Thus, it may be stated as a theorem that in order for total costs to 
be a minimum for any given output, the price of each factor must be 
equal to marginal physical productivity times marginal cost A This 
holds regardless oj revenue considerations . 

Of course, 

ax _ dfC 

dx dx 2 ’ 

(44) 

and 

ax a 2 C 

dw k ~ dxdWk 

(45) 

From (24), (25), and (29), 

a\ _ dvk 
duik dx ’ 

(46) 

or 

d 2 C _ dvk_ 
dxdWk dx 

(47) 


4 This has been pointed out in another connection in lectures by Professor Viner 
with ill uminating insight into the relationship of the external to the internal margin, and 
the broad zone of indifference as a substitute for the infinitesimal. Paradoxically, it is 
this condition which is basic to Mr. Wong’s famous envelope theorem! 
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That is, the change in any input item with respect to an increase 
in output must be equal to the change in marginal cost with respect 
to a change in the price of that input item. 

Recall from (26) 

dX _ d 2 C _ A n+Un+ i 

dx dx 2 A ' 


Now it is known from (17) and (27) that A has the sign of (— l) n . 
Also 




<p 11 


<Pn 1 


<Pln 


(fun 


H, 


(49) 


where H is called the Hessian determinant of the production func- 
tion. Obviously, therefore, the slope of the marginal cost curve must 
have the same or opposite sign as compared to this Hessian, depending 
upon whether the number of inputs is even or odd; i.e., 


(- 1 )*H 


dfC 

dx 2 


> 0 . 


(50) 


Thus the stability of pure competition is intimately tied up with 
the Hessian of the production function, a result not intuitively 
obvious. 

I should also like to indicate certain other results, leaving their 
rigorous derivation to the interested reader. Consider equation (5) , 

Vi = f(x, Wi, • • •, w n ). (i = 1 , •••,«) 


These functions are defined by (10) and (12), 


and 


X — (pip It * t ^n) 

Wi — \<pi = 0. (* = 1, 


(12) may be rewritten as follows: 


n) 


^ = — . (i = 2, •••,») (51) 

<pi Vh 

Obviously, the changing of all prices in the same proportion will 
leave the solution of (51) unchanged; hence, the functions in (5) 
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must be homogeneous of order zero in the variables (w lt • ■ •, w n ), 
x being constant; i.e., 

Vi = f(x, w u ■ ■ ■ , w n ) = f'(x, yw h • • • , yw„), (i = 1, •••,«) (52) 


where y is arbitrary. Hence, from Euler’s theorem on homogene- 
ous functions, 


0 = 



(«' = !,•••,«) 


(53) 


This can be verified by substitution from (22). 
By analogous reasoning, 




» dC 

C = A + 


where 


dC 


(54) 

(55) 


Actually, from the considerations of the previous chapter it is 
possible to deduce a more general condition which includes (34) as 
one minor part. The minimization of total expenditure for a given 
output, price being fixed, implies (10) and (12). In addition, at a 
regular minimum we must have 5 


» » ffo) . 


(56) 


for not all £,• proportional to w,-. Consider the determinant 


and 


d(Vh • • • , Vn) 
d(Wi, - ,W n ) 



(57) 


r _ dfon vj) 
12 d(w h wf 


3^1 

dvi 


dWi 

dv 2 

dv 2 

dwi 

dw 2 


r 123 


d(t>i, v%, vf 
d(Wi, w 2 , wi) 


etc. (58) 


Then each such principal minor of order m (less than n) must be 
negative or positive depending upon whether m is odd or even ; i.e., 

(- 1 > 0. C 12 ... n = G = 0. (59) 

5 Compare chap. iii, p. 22, Mathematical Appendix A, Sections IV and V, and chap. 
v y pp. 28-29. 



THEORY OF COST AND PRODUCTION 


69 


Specifically, 


d~Oj 


dVj dVk 


<0; (M>0; etc. (60) 

Kfuuj dWj dWk \ dwjc J K 

It may be well to summarize our results of this section : 


dvj 

dWj 


< 0 ; 


Wi = \<pi 


Y dVi 
1 dWj 


*> > 0 ; etc. 

d{Wj, W k ) 

(34) and (59) 

dv k _ C toy 
dWj ~ dwic ’ 

(31) 

d 2 C dVj 

dxdWj dx ’ 

(47) 

dC r 1 x 

( 12 ) and (42) 


(50) 

»• = 0 , (* = 1 ,•••,«) 

(53) 

dC 

dWi V " 

(55) 

A . " dC 
= A + 22 Wi - 5 — 

1 dWi 

(54) 


Boundary or Corner Minima 

Even in the case where the production function and its deriva- 
tives are continuous with the proper convexity to insure a uniquely 
determined optimal position, the interesting case may arise where 
some factor may not be used at all. That is to say, the more that 
others are used and the less of it, the lower will costs be for any 
given output. In this case the conditions for equilibrium do not 
require the equalization of the marginal productivity of the last 
dollar spent on it to that of other factors. Rather do we have a 
boundary minimum due to the fact that no negative values are 
economically admissible. Hence, the conditions of equilibrium 
are given by the statement that for any input, potentially usable 
but not actually used, the marginal productivity of the last dollar 
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spent on it must not be greater than the marginal productivity of 
the last dollar spent on factors which are used . 6 
Mathematically, 


Vu ^ 1 

w u ~ X ! 


( 61 ) 


where the uth factor is one not actually used. 

As the price w u changes, it may still remain unused until some 
critical level at which it will begin to be used, and hence will fall 
into the analysis of the previous section. Of course, the critical 
level may very well depend on the scale of operations; i.e., output, 
so that with the same price it may still be brought into use by an 
increase in output . 7 

The demand function for such a factor of production will have 
the following properties : 

Vu = f u (x, w u , W U ■ ■ -,w n ), (62) 

j tj 

-r-~— s= 0 in some domain defined by ^(x, w„, w u • • • , w n ) < 0, (63) 

aw u 

dv 

< 0 in some domain defined by ^(x, w u , Wi, ■ • ■ , w n ) > 0, (64) 

ow u 

where C is so constructed as to form the locus of all critical points 
described above. 


Discontinuities in the Production Function 

I should now like to drop the assumption that the production 
function is necessarily continuous with continuous partial deriva- 
tives at every point. This assumption has been challenged by 
many economists, who have alleged that production coefficients 

6 There is an interesting discussion of an exactly analogous equilibrium system in 
the famous paper of J. Willard Gibbs, “The Equilibrium of Heterogeneous Substances,” 
Collected Papers , I, 55-349. 

7 An analogy is suggested to the case of items which do not enter into a consumer's 
budget until income increases or their relative prices decrease to some critical levels. 
Be it noted that this phenomenon here described can occur even though there be in- 
creasing marginal physical productivity, just as the budget case does not restrict the 
behavior of marginal utility. It can be mathematically proved that this result is inde- 
pendent of the cardinal measure of product (utility). Another analogy is provided by 
the classical comparative cost doctrine — according to which a country specializes com- 
pletely in one good; the equilibrium is defined by a certain inequality between prices 
and marginal costs. 
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are technically fixed, that some factors are “limitational,” some 
factors of production “must” be used in certain joint proportions, 
etc. These discontinuities, if true of the real world, have been 
thought by many economists to offer serious problems to the analy- 
sis of distribution and the pricing of the factors of production. 

It will be argued here that the fact of discontinuity offers no 
problems to the firm — on the contrary, its task is made much 
easier. As an obiter dictum I hold that it also offers no particular 
difficulty to the analysis of the wider problem of determining the 
prices of the factors of production with which each firm is to be 
confronted. As before, these are to be determined by general 
equilibrium analysis of supply and demand. 8 

As before, we have a production function relating maximum 
output to any given set of inputs: 

x = <p(v i, •••,»„)• (65) 

Precisely as argued in the first section this must be single- valued. 
However, it need not be continuous nor have partial derivatives 
at every point. In order that there be no contradiction to our 
definition of output as maximal, we must have 

A<p S 0, for A Vi £ 0. (i = 1, • ■ ■ , n) (66) 

That is, as we increase all factors together, output cannot decrease, 
since otherwise product would not in the next position be maximal. 

For definiteness I assume that along an isoquant the production 
function contains only a finite number of points which do not 
possess continuous partial derivatives. At a point of discontinuity 
it is assumed that left-handed and right-handed partial derivatives 
exist. Of course, at a point of discontinuity there does not exist 
a uniquely defined plane tangent to the isoquant, but limiting 
direction cosines can be found for all planes which touch but do 
not cut the isoquant surface. It is also assumed that the isoquants 
are of a “single concavity,” to be defined later. The production 
function so defined is general enough to include the case of fixed 
coefficients of production, “perfectly complementary” factors, 
limitational factors, etc. 

8 It is possible that within a narrow range the price may be indeterminate due in 
special cases to coincidental inelasticity of supply and demand. 
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Note that all the inputs are to be regarded as independent 
variables. It is never true that they must be used in fixed propor- 
tions. It is true that it may be unprofitable not to do so, but this 
is a result of economic calculation. Even in the continuous case, 
given certain economic data, the factors must (on consideration of 
profitability) be used finally in given proportions, and in deter- 
mined amounts for each output. The only difference between 
these cases is that in the discontinuous case the required optimum 
point may be more obvious and less sensitive to changes in the 
prices of all factors of production. Be it understood that I do not 
minimize discontinuity. On the contrary, I should like to indicate 
the outline of a method which will handle both cases. 

Along an isoquant it is assumed that we have convexity as de- 
fined below. Consider any point on an isoquant, (pi 1 , • ■ v n l ). 
There must exist constants, («/, • • • , an 1 ), (not necessarily unique) 
such that 

£ - Vi 1 ) is 0, (67) 

i 

where (»i 2 , • ■ • , vj) is any other point along the same isoquant. 
This merely says that there exist one or more tangent planes at 
each point which touch but never cross the isoquant. Similarly, 
at the second point there exist constants, (ar, • • • , aj), such that 

£ - v*) a 0, (68) 


- E <XiW - v 1 ) 2= 0. (69) 

i 

Adding (69) and (67) and changing sign, we get 

£ (a, 2 - - v 1 ) £ 0. (70) 

Since these are two arbitrary points, taking one as fixed, say 
W • • • , Vn°), we must have along an isoquant 
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and 


£ AaiAvi Si 0. 


( 72 ) 


Actually, it can be verified that the necessary and sufficient 
conditions imposed on (ai°, • • •, a„°) in order that (71) holds are 
the following inequalities: 


<P 


.s 


ai 


<Pl L Oil 0 


jP_ 
<P i 


.L 


(73) 


where <pi L and <pi s are respectively the largest and smallest of the 
left- or right-hand derivatives at the given point. At any point 
where there is no discontinuity, the right-hand and the left-hand 
derivatives are identical; hence, the inequality converges to the 
equality, 


ai° (My i°, • • • , v „°) 


(74) 


Oil ° <Pl(Vl°, ■ • • . »n°) 

All of the above are merely elaborations of the definition of con- 
cavity. It remains to show its relevance to the problem at hand. 

Conditions of Equilibrium 

Suppose we are given a set of prices ( Wi ° , • • • , w n °) correspond- 
ing to which there is one combination of factors (t>i°, * • • , v n °) which 
minimizes total cost for given output. As a definition of our 
minimum 

AC g 0 (75) 

for 

Ax = 0, AVi $ 0. 

In other words, for any point (»i, •••,»«) along the same isoquant 
we must have 




£ 

i 


(76) 


for 


or 


<p(Vl, ' ‘ Vn) = <P(V 1°, ‘ • •, Vn°), 


£ wfAVi g 0 

1 


( 77 ) 
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for 

<p(v i + Avi, - • • , v n ° + Av„) - <p(v i°, • • • , v n °). 

Obviously from (73) this implies 

fl < *£ < (7&) 

<pi L Wi 0 <pi s ' 

In the case that the maximum point is one at which continuous 
derivatives exist, we have the conditions of the previous section, 


<Pi wf. 
(pi ~ wi u " 


(79) 


Thus, condition (78) is the general condition which includes (79) 
as a special-case. 

Furthermore, it is still possible to place definite restrictions on 
the demand functions for the factors of production, output being 
constant. As before, 


Vi° = f{x, Wi°, • • •, w n °), 
or 

Av i = fix, Wi a + Aw h • • • , W n ° -j- Aw n ) — Vi°. 
From (72) we have 


for 


E AWiAVi g 0 

i 

Ax = 0. 


(80) 

(81) 

(82) 


Suppose that only the price of the kth factor varies. Then 
(82) becomes 

AWkAvjc 0. (83) 

That is, an increase in the price of a factor cannot result in an 
increase in its use. Likewise, a decrease in its price cannot result 
in a decrease in its use. Still more generally, it can be stated that 
a change in the price of any number of factors cannot result in a 
change in amounts of all the factors in the same direction: i.e., 

E AwjAVj s 0, (r s n) (84) 

x 

for all Aw, s 0, not all AVj > 0 ; likewise, all Awj S 0 implies not 
all Av, < 0. 
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Determinateness of Equilibrium 

Clearly our minimum cost is unambiguously determined even 
in the case of discontinuity. The task of the firm is easier because 
the penalty for not being at the minimum point is greater and more 
obvious. To be at the top of a smooth hill requires fine balancing 
and judgment. To find a maximum which is a cusp or spire is 
much easier. Moreover, such an equilibrium is extremely stable. 
It is sometimes called “too stable” equilibrium. In order to move 
it a very large change in prices may be necessary. In the limiting 
case where the increase of each factor in more than the optimum 
proportion results in no increase in product, the wage may fluctuate 
from zero to infinity without changing the proportions of factors 
employed for each output. And yet the problem is completely 
determined. 

It is curious to see the logical confusion into which many 
economists have fallen. The primary end of economic analysis is 
to explain a position of minimum (or maximum) where it does not 
pay to make a finite movement in any direction. Now in the case 
that all functions are continuous, it is possible as a means towards 
this end to state certain equalities on differential coefficients which 
will (together with appropriate secondary conditions) insure that 
certain inequalities will hold for finite movements. It is no exagger- 
ation to say that infinitesimal analysis was developed with just 
such finite applications in view. Unfortunately, the means have 
become confused with the ends, and so conventions and artifices 
are continually sought in order to be able to make statements con- 
cerning marginal equivalences. A case in point is the Marshallian 
marginal net productivity. It is only in the singular case that the 
production function is differentiable (i.e., possesses certain proper- 
ties of continuity) in a certain direction (i.e., for certain composite 
movements) that it is possible to employ this device ; whereas the 
inequalities of condition (78) always give necessary and sufficient 
conditions, and include the marginal net productivity relations 
whenever these happen to be applicable. 9 

9 An example of another improper use of marginal net productivity curves is pro- 
vided by the treatment in Mrs. Robinson’s The Economics of Imperfect Competition 
(London: Macmillan, 1933), chap. xx. The reasoning is mathematically circular; the 
so-called demand curves shift with changes in the wage of one factor ! This follows from 
the fact that they are drawn up as of appropriate amounts of the other factors. These 
appropriate amounts are necessarily a function of the prices of all factors of production. 
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Although equilibrium is determinate for the single firm con- 
fronted with prices of the factors of production, discontinuities of 
the production function may introduce some difficulties into the 
general equilibrium problem whereby all firms and individuals 
together through the interplay of their demands and supplies de- 
termine the prices with which each is confronted. For discon- 
tinuities may introduce perfect inelasticities of demand in certain 
domains ; there emerges the possibility, however remote, of coinci- 
dental inelasticities leading to indeterminacy of price within certain 
limited ranges. But this is outside the scope of the present 
discussion. 

Maximization of Profit 

e 

By now we have attained to the point where most discussions 
begin. We have seen how to derive the locus of factor combina- 
tions which give lowest total costs for each output. But as yet 
the scale of operations, level of output, actually to be undertaken 
has not yet been determined. This can only be done in the face 
of a new set of considerations, those relating to the terms at which 
different amounts of the commodity can be sold. I take as a 
datum the maximum amount of gross total revenue which can be 
secured for each level of output. This may be written 

R = R(x). (85) 

Let us define profit, net revenue, as the difference between gross 
revenue and total expenditure, 

x = x(x, w i, ■ • w „ ) = R(x) — A — V(x, w h • • - , w n ). (86) 

Output will be optimum when profit is at a maximum. Necessary 
conditions that this be so when all functions are differentiable are 


dir _ dR 
dx ~ dx 
dhr _ d 2 R 
dx 2 dx 2 


dV 
dx 
dW 
dx 2 


0 , 


0 . 


(87) 

( 88 ) 


Assuming that we have a regular relative maximum, this becomes 


dR = dV 
dx ~ dx ’ 
8*R dW 
dx 3 dx 2 


(89) 


( 90 ) 
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This is the familiar theorem that at the optimal output the 
marginal revenue curve must intersect the marginal cost curve 
from above. 10 


Regardless of whether or not the profit function has 
at each point our proper maximum conditions are 

derivatives 

Att < 0, 

for 

Ax 5 0. 

(91) 

<||<J 

V 

C<\h 

<j|<a 

o 

for 

Ax > 0, 

(92) 

A R AC 
Ax Ax ’ 

for 

Ax < 0. 

(93) 

The economic common sense of this is 

obvious. 11 



Equation (89) gives us a relation to determine optimum output 
x°. By substituting the value of x°, so obtained, in (5) we get a 
new set of demand curves for factors of production, drawn up as 
of a given total revenue curve. 

Vi = f(x°, w u • • • , w n ) = g%w h ■ ■ ■ , w n ). (i = 1, • • • , n) (94) 

It is possible to derive the positions of output, input, etc., in a 
more direct fashion by treating all the inputs as independent 
variables. 

Let 

n 

ir(v 1 , wu ■ • - ,w n ) = R\_tp(v 1 , •••,»«)] — A - z V){0i. (95) 

i 

At a proper maximum 

<Pi — Wi = 0, (i = 1, • • • , n) (96) 

and 

jT ~ [) <pij “h R-zx O i 

must form the coefficients of a negative definite quadratic form. 

10 The famous cost controversy may be interpreted as an argument over the impli- 
cations of these conditions. 

11 In the case of pure competition, when price is independent of sales, these condi- 
tions become 

* < — and p > — , respectively. 

Ax Ax 

It can also be shown that ApAx ^ 0; i.e., that an increase in prices cannot, ceteris paribus , 
result in a decrease in quantity supplied. Hence, the firm s supply curve cannot be 
negatively inclined. 
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The results in (96) follow from (43) and (89), since for any 
output factor price must equal marginal cost times marginal phys- 
ical productivity, while for optimum output marginal cost equals 
marginal revenue. These are n equations from which we can 
solve for the n factors of production in terms of the n prices to get 
the demand functions of (94). 

Vi = g^wu ■ ■ ■ , w n ). (i = !,•••, n) 


Actually, it is known that 12 




(97) 


where Ta is' the cofactor of the element of the itb. row and the jth 
column of the above matrix. From the definiteness of the above 
quadratic form it follows that these last must form the coefficients 
of a negative definite form; i.e., 


dvj 

dWj 


< 0 ; 


fKgh g*) n . 

d(wj,w k ) ^ ’ 


etc. 


(98) 


Indeterminacy in Purest Competition? 

If competition is “pure” in the commodity and factor markets, 
and the production function is homogeneous of the first order, then 
it is a classical fact that the matrix T, which except for factors of 
proportionality is by the first assumption identical with the Hessian 
of the production function, is singular. Therefore, a regular maxi- 
mum for the firm is impossible. Unit costs being constant, and 
demand being horizontal, there are only three possibilities: price 
being everywhere greater than marginal cost, it will pay the firm 
to expand indefinitely, i.e., until competition ceases to be pure ; or 
if price is less than marginal cost, no output will be produced ; or, 
finally, if price is identically equal to marginal cost, the exact 
output of the firm will be a matter of indifference. Thus, what 
was usually regarded as the most favorable case for pure competi- 
tion turns out to yield indeterminate output for each firm. 

However, too much should not be made of this paradox. Even 
though the output of every firm may be indeterminate, their sum 

“ Compare H. Hotelling, “Edgeworth’s Taxation Paradox and the Nature of 
Demand and Supply Functions,” Journal of Political Economy, XL, 577-616. 
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may be determinate, in the same way that the sum of two discon- 
tinuous functions may be continuous. For if many firms expand 
output, others will contract; for price will fall along the industry’s 
demand curve causing contraction. It has been argued, however, 
that competition will disappear since any one firm, with no ob- 
stacles to its expansion, will grow until it is a “significant part of 
the market,” at which time it will be able to affect the price of 
its product. 

This is similar to the familiar argument by which it is shown 
that decreasing marginal cost within a firm will lead to monopoly. 
Nevertheless, the analogy is faulty. For the demand curve of any 
firm is equal to the demand curve of the industry minus the supply 
curve of the remaining firms, already in the industry or’ potentially 
therein. This being the case, it is easy to show that under uniform 
constant costs the demand curve for a firm is horizontal even 
though it produces 99.9 per cent of all that is sold, Geometrically, 
the long-run supply curve of potential rivals is horizontal, and a 
horizontal curve subtracted laterally from any curve must always 
yield a horizontal curve. Economically if the firm were to begin 
to restrict output so as to gain monopoly profit, it would cease to 
sell 99.9 per cent of the output or even anything at all. Conse- 
quently, it would not attempt to do so, but would find its maximum 
advantage in behaving like a pure competitor. 

Thus, it remains true that the classical assumptions underlying 
pure competition are actually consistent. It is no accident that 
Walras and Marshall paid so little attention to the firm and so 
much to the industry. For under the purest conditions of competi- 
tion the boundaries of the former become vague and ill-defined, and 
also unimportant, since through reactions to prices the factors of 
production adjust themselves in the right proportions and in the 
right total amounts for the industry. 

Perhaps to a greater extent than in the case of increasing cost 
will the industry be subject to oscillation around its equilibrium. 
However, once displaced, output will tend to return to the correct 
value, so that the equilibrium may be said to be stable even though 
it would appear that a limitingly small individual would have no 
incentive to keep his output unchanged. However, the same is 
true of the situation where the price of the same good in two 
markets just balances the transportation costs necessary to transfer 
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a unit of the good from one market to the other. Any one arbi- 
trager has no special inducement to ship more or less than he is 
actually doing, nor even to ship just the amount that he is doing. 
Yet the equilibrium is stable; for if the right amount were not to 
flow, the spatial price differential would change so as to return the 
system to its previous position. To an infinitely near-sighted olive 
the bottom of the cocktail glass appears level, and it no doubt 
regards itself as being in neutral equilibrium. Actually, the equi- 
librium is stable as any finite movement will show. 

Discontinuous Case 

In the general case where the production function is not neces- 
sarily differentiable we have still for a maximum 

Air S 0, for AVi § 0. (99) 

As a special case of this, for a movement of one factor, all others 
constant, we must have 


AR Ax 

Ax A Vi W " 

for 

AVi > 0, Avj — 0, 

(100) 

AR Ax 

Ax A Vi ^ W " 

for 

A Vi < 0, Avj = 0. 

(101) 


That is, it must pay to move neither backward nor forward. 

Moreover, consider a given set of factor prices (wi°, • • • , w n °) . 
Corresponding to this there is a set of factors (V, • • • , v n °) which 
give a maximum profit. In order that this be a real maximum 

•K[p(V, • • V)] — A — jt V>iW = i?[<p(»I° t • • •> VO] 

1 

-A-i w<V. (102) 

i 

Consider a set of prices ( w x \ • • • , » B l ) for which this (arbitrary) 
point (V, • • • , V) is the point of maximum profits. Then 

£[>(V, - - -, V)] -A-Z WiV Si -K[>(V, • • •, V)] 

1 

- A — Z (103) 

i 
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Now adding (102) and (103) and canceling terms we get 

Z (w* 1 - - Vi°) si 0, (104) 

1 

or 

E Aw t Avi S 0. (105) 

1 

For only the jth price varying, this becomes 

AwjAvj Si 0. (106) 

In other words, a decrease in a price cannot result in a decrease in 

the factor used. Further possible interpretations are of course 
possible. r 

As before, the general case is simpler than that of the special 
continuous case. Moreover, the method of finite increments ap- 
pears to be mathematically simpler in the sense that it is possible 
to state the qualitative direction of changes without solving in- 
versely for the actual demand functions. 

The method employed here is that which underlies Le Chatelier’s 
principle in physics. By making use of Professor E. B. Wilson’s 
suggestion that this is essentially a mathematical theorem ap- 
plicable to economics, it has been possible to gain increased gener- 
ality without increased complexity and emptiness. 

It is important to realize just how much content there is to a 
particular economic theory. As far as the single firm is concerned, 
everything fundamental which can be said is implied in the state- 
ment that in equilibrium there must exist no movement by which 
the firm can improve its profits; i.e., Ai s 0 for all movements of 
variables possible to the firm. In the case of continuity certain 
necessary relations of differential coefficients (marginal equiva- 
lences) are implied. Moreover, assuming certain specific forms to 
our functions (independence of prices, etc.), it is possible to deduce 
formally the implications of an equilibrium position (e.g., negative 
demand curves, positive supply curves, etc.). It appears that no 
more than this can be validly stated. 

External Conditions of Equilibrium 

Thus far we have been discussing the conditions of equilibrium 
imposed from within the enterprise by its desire to maximize 
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profits. This has resulted in certain marginal inequalities. Econ- 
omists have not stopped here, but have also tried to analyze certain 
conditions of equilibrium resulting from inter-competition among 
firms. That is to say, they have tried to state conditions upon 
the market situation (obstacles) with which each firm will be con- 
fronted. In particular, they have been interested in the determina- 
tion of the rate of profit which any firm can earn. 

It has often been argued that not only must price (average 
revenue) under “perfect” competition equal marginal cost, but also 
it must be equal to average cost so that net revenue will be zero. 
This second condition has not always been recognized as being of 
an entirely different nature from the first. In this section an 
attempt will be made to distinguish between them. It is hoped 
that in so doing it will be possible to put the famous “adding up” 
problem and homogeneity of the production function in its proper 
place. 

In the beginning, to avoid confusion, no use is made of the 
term “perfect” competition. The term “pure” competition will be 
understood to mean that the demand curve for any producer is 
infinitely elastic, that his sales cannot affect prices. The problem 
of discontinuity is ignored. Under these conditions the internal 
conditions of equilibrium are that marginal cost be equal to mar- 
ginal revenue (price), and hence, that the marginal physical produc- 
tivity of each factor times the sales price of the good be equal to 
the price of the respective factor. 


dR d£ 

dx ~ P ~ dx ’ 
dC 

Wi = fa ** = pcpi ~ 


(107) 

(108) 


These are marginal conditions and say nothing about the totals 
involved. It is also true by definition of the “long-run” as that 
period in which all costs can be avoided by going out of business, 
that the firm must never have a negative net revenue. As a 
condition internally imposed we know that 

TTgO, (109) 

or 

R(x) m E WiVi. 

i 


( 110 ) 
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Some writers, by a curious play on words, have been able to 
arrive at the condition that average cost equals price. A typical 
form of the argument is as follows: (1) a firm will equate marginal 
cost to price ; (2) it will also try to minimize its unit cost ; (3) at the 
point of minimum unit cost average cost equals marginal cost; 
(4) hence, average cost must equal price (average revenue) and 
profits will be zero. 

Stated explicitly, it is obvious that the second statement is false. 
The play on words arises from the confusion of the condition that 
for each and every output total and unit costs must be a minimum 
with the statement that of all outputs possible that one is chosen 
at which unit cost is the lowest. The first implies the valid condi- 
tion that the marginal productivity of the last dollar irr every use 
must be equal. The second implies the invalid (from internal con- 
siderations) condition that output be determined irrespective of the 
sales price. 

On the other hand, some have tried to argue as follows: (1) the 
production function from the nature of things must be homogene- 
ous of the first order; (2) by Euler’s theorem it follows that if 
factors are paid “according to the marginal productivity principle,” 
product will be exhausted. 

As an example of the lack of integration between the theory of 
production and that of cost, we find many writers asking whether 
product will be exhausted at the same time that they have already 
agreed that price equals average cost, and total revenue equals 
total cost. Of course, the latter condition is merely another way 
of stating the former. 

Once the problem is properly stated as that of determining the 
relation between gross revenue per unit and expenditure per unit, 
it should be reasonably clear that this cannot be determined by 
the properties of the production function alone, but must depend 
upon the marketing situation of the firm, which in turn depends 
upon the competition of other firms. It is quite clear that as far 
as the single firm is concerned it is possible that it be making huge 
profits regardless of the homogeneity of the production function. 
This condition is neither necessary nor sufficient to the exhaustion 
of the product. If the production function were homogeneous, but 
demand were sufficiently favorable, of course product would not 
be exhausted — even under pure competition. 
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The problem of homogeneity of the production function is one 
about which much controversy has raged. It has long been held 
on philosophical grounds that product must be a homogeneous 
function of the first order of all the variables, and that if this is not 
so, it must be either because of “indivisibility” or because not all 
“factors” have been taken into account. With regard to the first 
point, it is clear that labeling the absence of homogeneity as due 
to indivisibility changes nothing and merely affirms by the implica- 
tion that “indivisibility” does exist, the absence of homogeneity. 

With respect to the second point, we may reverse the Aristo- 
telian dictum and affirm that anything which must be true self- 
evidently (“philosophically”), intuitively — i.e., by conventional 
definition of the terms involved — that such a principle can have 
no empirical content. It is a scientifically meaningless assertion 
that doubling all factors must double product. This is so not 
because we do not have the power to perform such an experiment ; 
such an objection is of course irrelevant. Rather the statement 
is meaningless because it could never be refuted, in the sense that 
no hypothetically conceivable experiment could ever controvert 
the principle enunciated. This is so because if product did not 
double, one could always conclude that some factor was “scarce.” 18 

It is useful, I believe, to avoid the expression “factor of pro- 
duction” entirely. This has been used in at least two senses, 
neither of which is quite satisfactory. First, it has been used to 
denote broad composite quantities such as “labor, land, and capi- 
tal.” On the other hand, it has been used to denote any aspect 
of the environment which has any influence on production. I sug- 
gest that only “inputs” be explicitly included in the production 
function, and that this term be confined to denote measurable 
quantitative economic goods or services. The production function 
must be associated with a particular institution (accounting, 
decision-making unit, etc.), and must be drawn up as of any unique 
circumstances pertaining to this unit. Other definitions are of 

13 Any function whatsoever in n variables may be regarded as a subset of a larger 
function in more than n variables which is homogeneous of the first order. It is because 
this is true of any arbitrary function that this generalization is useless. For example, 
the volume of a sphere not being a homogeneous function of the radius, a new factor can 
be defined whose '‘scarcity” will explain this fact. Like a false proposition in logic from 
which every proposition can be derived, this overgenerality renders such a convention 
useless. 
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course possible, but it is clear that our previous conditions cannot 
be expressed in terms of them. 

So defined, the production function need not be homogeneous 
of the first order. If really homogeneous, marginal costs would 
always be constant . 14 It is indicative of the lack of integration 
mentioned above that many writers assume U-shaped cost curves 
in the same breath with homogeneity of the production function. 

In reality, it is not on philosophical grounds that economists 
have wished to assume homogeneity, but rather because they were 
afraid that, if they did not do so, contradictions would emerge to 
vitiate the marginal productivity theory. This is simply a mis- 
conception as will be indicated below. 

Our discussion can be confined to the relation of total cost and 
total revenue. The implications for marginal productivities can 
then be indicated. 

It is clear that the firm does not of its own volition act so as to 
equate average gross revenue to average expenditure, although it 
may in the long-run prevent average revenue from being less than 
average cost by going out of business. 

It is only through the competition of new firms that the demand 
curve of the firm may so shift downward as to make the position of 
maximum profit one at which total gross revenue equals total 
expenditure. 

Reserving for later investigation the conditions under which 
the demand curve will so shift, let us investigate the implications 
of the assumption of zero net revenue. Given 


8R = dC 
dx dx ’ 


( 111 ) 


and 

x = R - C = 0, (112) 

it follows that the demand curve must be tangential to the total 
unit cost curve. 

In the case of pure competition the demand curve is a hori- 
zontal line, and under the conventional assumptions as to the shape 
of the cost curve, the tangency will be at a unique point, that of 

14 It follows from Euler’s generalized theorem on homogeneous functions that the 
Hessian of a homogeneous function of the first order is identically zero. Obviously, 
stable equilibrium for a firm under pure competition is impossible in these circumstances. 
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minimum average cost. This follows from the fact that average 
cost must be equal to marginal cost, and that the latter is rising. 



dR dC C . . 

dx ~ P ~ Ihc ~ x = mmirnum average cost. 10 

(113) 

Of course, 



TO 

px = J2 WiVi by assumption, 

i 

(114) 

and 

dC 

Wi ~ The <pi ~ 

(115) 

Hence, 

* 



TO 

* = E <piVi. 

1 

(116) 


This looks superficially like Euler’s theorem, but it is not. For 
Euler’s theorem is an identity and should be written 


n 

X = 2 
1 


(117) 


whereas this is merely a condition of equilibrium holding at a 
single output. 

Moreover, in the case that the demand curve may be negatively 
inclined, we get the more general formulation 


dJR 

Wi. (118) 

Pi 

This formulation differs from that of Walras, Wicksell, and 
Hicks in that the condition of minimum unit cost is derived as a 
theorem from the condition that total revenue equal total expendi- 


15 At a minimum of average cost 
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ture. It is this last condition and the forces which lead to it that 
are of importance, and not the question of homogeneity at all. 

It is quite clear that in the real world net revenue is not zero 
for all firms, nor is it tending towards zero. This is true under 
pure competition as well as impure competition. It is clear that 
this residuum must be “due” to something, and it may be labeled 
by any name we please (rent to institutional advantage, etc.). 

The existence of this residuum does not imply any indeter- 
minacy whatsoever. Optimal output, revenue, expenditure, and 
the difference between these two terms are all fully determined. 
Of course, under ideal conditions this residuum will be capitalized 
by going concern valuations. Economists, remembering the class- 
ical solution of the problem of distribution in which the shares of 
two or more factors were simultaneously residually determined, 
have swung too far in the opposite direction. The attempt to 
“explain” all residua in terms of marginal productivity analysis 
applied to a wider production function can always be done by 
convention, but is devoid of empirical content. 

It is convenient to have an analytic definition for the case in 
which “competitive” conditions between firms are such that the 
demand curve of any firm will always shift until net revenue is 
equal to zero. The term “free entry” may be defined as the con- 
dition under which this holds. Of course, this classification cuts 
across that of pure or impure competition. Thus defined, free 
entry is a condition to be looked for empirically, rather than one 
imposed upon the data a priori. 

I suspect that part of the economists’ intuitive desire to define 
a category of “profits” as distinct from “rent to institutional ad- 
vantage” stems from a subconscious remembrance of the old- 
fashioned distinction between “natural” and “contrived” scarcity. 
Perhaps too little is heard today of this distinction, which has 
important connotations for social policy and welfare economics. 

Summary 

In conclusion, a summary formulation of the analysis is pre- 
sented to replace the famous three Theorems of Walras. These, 
aside from being redundant and ambiguous, are not each of the 
same order of meaning. The formulation here is for the continuous 
case and in terms of marginal equivalences, but the more general 
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formulation in terms of marginal inequalities readily suggests itself. 
Throughout a firm with given production function, factor prices, 
and demand conditions is posited. The general case of pure or 
impure competition is considered. 

I. The first fundamental assumption is that the firm tries to 
maximize its profits, and from this the following internal conditions 
of equilibrium can be deduced. 

A. Any output which is produced must be produced with factor 
combinations such that total cost is a minimum. As a result of 
this we have two corollaries. 

1. The marginal productivity of the last dollar must be equal 
in every us§. 

2. The price of each factor of production must be proportional 
to marginal physical productivity, the factor of proportionality 
being marginal cost. 

B. That output will be selected which maximizes net revenue, 
total cost being optimally determined by the previous conditions. 
This implies 

1. The equality of marginal cost and marginal revenue, the 
slope of the latter being the smaller. 

2. In combination with the previous conditions under A we 
also have the marginal value productivity of each factor equal to 
its price, the first term being defined as marginal revenue times 
marginal physical productivity. 

3. Total cost must not exceed total revenue, since otherwise 
the firm would go out of business. 16 

II. If we impose by arbitrary assumption or hypothesis the 
external conditions that entry be free, i.e., that total revenue be 
equal to total cost, then . 

A. Product will be exhausted by definition. 

B. The demand curve must be tangent to the unit cost curve. 
In the case of pure competition this implies minimum average cost. 

16 If the enterprise under consideration owns productive resources which have a sale 
value on the market, it is necessary that net revenue be at least as large as the sale 
(liquidation) value of these resources. As an internal condition of equilibrium w sale 
value of owned resources. Alternative uses elsewhere introduce “opportunity costs.” 
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Aside from the above general conditions of equilibrium, it has 
been shown how the definition of an extremum position may be 
utilized (a) to evaluate the direction of change of variables with 
respect to parameters (prices) taken as data, regardless of condi- 
tions of continuity, and (b) to develop reciprocal relations imposed 
upon demand derivatives, where these exist. 



CHAPTER V 


THE PURE THEORY OF CONSUMER’S BEHAVIOR 

If one were looking for a single criterion by which to distinguish 
modem economic theory from its classical precursors, he would 
probably decide that this is to be found in the introduction of the 
so-called subjective theory of value into economic theory. This 
revolution in thought broke out almost simultaneously along three 
fronts, and with it we associate the names of Jevons, Menger, 
and Walras. 

Moreover, it is this part of economic doctrine which has proved 
to be the center of so much controversy. Indeed, many critics of 
the orthodox tradition have identified the whole body of economic 
theory with the belief in that abstraction, homo economicus. In 
fact, many economists, well within the academic *f old, would sepa- 
rate economics from sociology upon the basis of rational or irra- 
tional behavior, where these terms are defined in the penumbra of 
utility theory. It would seem extremely important, therefore, to 
know clearly what is contained in the conventional utility analysis, 
if only to understand the consequences of denying its validity. 

Evolution of Utility Concept 

The concept of utility may be said to have been undergoing 
throughout its entire history a purging out of objectionable, and 
sometimes unnecessary, connotations. The result has been a much 
less objectionable doctrine, but also a less interesting one. With- 
out doing justice to the subject, these developments may be sum- 
marized in a brief way. It must be clearly understood, however, 
that these are the movements of the pioneers of thought. Their 
work appears chiefly in academic journals, and has little influenced 
the general class of economists. 

(a) One clearly delineated drift in the literature has been a 
steady tendency towards the rejection of utilitarian, ethical, and 
welfare connotations of the Bentham, Sidgwick, Edgeworth va- 
riety. These matters still receive consideration in questions of 
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normative policy, but they are clearly separated from the problem 
of consumer’s behavior. Although especially marked in regard to 
inter-individual welfare comparisons, there is the same tendency 
in connection with the analysis of the behavior of a single indi- 
vidual. Only as obiter dicta do we find in the modem literature 
discussions of particular pleasures as being pure or impure, etc . 1 

(b) Concomitantly, there has been a shift in emphasis away 
from the physiological and psychological hedonistic, introspective 
aspects of utility. Originally great importance was attached to 
the ability of goods to fill basic biological needs; but in almost 
every case this view has undergone extreme modification. At the 
same time, there has been a similar movement away from the con- 
cept of utility as a sensation, as an introspective magnitude. It is 
not merely that the modern economist replaces experienced sensa- 
tion or satisfaction with anticipated sensation, desire, according to 
the now familiar distinction between ex post and ex ante analysis. 
But much more than this, many writers have ceased to believe in 
the existence of .any introspective magnitude or quantity of a 
cardinal, numerical kind. With this skepticism has come the recog- 
nition that a cardinal measure of utility is in any case unnecessary; 
that only an ordinal preference, involving "more” or “less” but not 
“how much,” is required for the analysis of consumer’s behavior. 

Indeed, so far has the reaction gone that it is the belief of many 
that nothing remains but an empty convention. Others, who do 
not admit the hollowness of utility, have in some cases embraced 
a formulation of the analysis which is meaningless in any opera- 
tional, empirical sense . 2 The result is a curious jargon of dogmatic 
precepts. 

Thus, the consumer’s market behavior is explained in terms of 
preferences, which are in turn defined only by behavior. The re- 
sult can very easily be circular, and in many formulations un- 
doubtedly is. Often nothing more is stated than the conclusion 
that people behave as they behave, a theorem which has no em- 

x The Cambridge tradition is perhaps an exception in this respect, although even 
here the change in emphasis is notable. 

2 Cf. Alan R. Sweezy, “The Interpretation of Subjective Value Theory in the 
Writings of the Austrian Economists,” Review of Economic Studies , vol. I, no. 3 (1934), 
pp. 176-185. 
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pirical implications, since it contains no hypothesis and is consistent 
with all conceivable behavior, while refutable by none . 3 

Nevertheless, as we shall see, the modem utility theory with all 
its qualifications is not in a technical sense meaningless. It is a 
hypothesis which places definite restrictions upon demand func- 
tions and price-quantity data; these could be refuted or verified 
under ideal observational conditions. One should have thought 
that these empirical implications would have been the sole end of 
the theorists who have concerned themselves with these matters. 
Strangely enough, means and ends have been so confused that only 
a small fraction of the literature has been concerned with this 
problem even indirectly; moreover, in this there are scarcely half 
a dozen papers in which valid demand restrictions have been 
developed. 

I do not propose to defend the fruitfulness of these empirical 
restrictions. The extent to which they satisfy and unify the fac- 
tual behavior of consumers cannot be settled by argumentation. 
However, for better or worse the theory of utility has occupied an 
important position in economic thought for the last half century. 
This alone makes it desirable that its meaning be clearly understood. 

Progression in Mathematical Thought 

From the beginning mathematical methods have figured promi- 
nently in the analysis of utility. Despite the unfavorable reaction 
produced among some writers who felt that a spurious precision 
was implied by the use of these supposedly “exact” tools, it is 
demonstrable from the literature that symbolic methods have been 
an aid to clear thinking and the advancement of the analysis . 4 
For those who used this abstract language were forced to formulate 
their concepts unambiguously, and so the way was opened for 
modification and qualification. 

It is interesting, therefore, to review very briefly the history of 
some mathematical aspects of the theory to bring out clearly the 
progression in thought through time. 

3 Still another “meaningless” theory is held by those writers who speak of behavior 
in terms of the economic principle , regardless of whether any empirical behavior related 
to it exists. 

4 Thus, Edgeworth's Mathematical Psychics offers penetrating insight into the views 
commonly held in his day. 
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As early as 1854, Gossen is credited with presenting what is 
essentially marginal utility. He assumed this to be a decreasing 
linear function of the quantity of any particular good. The utility 
function would therefore be as follows: 

U = K + (ai*i — biXi 2 ) + (© 2*2 — b 2 x 2 2 ) '+•••. (1) 

Jevons, writing fifteen years later, proposed that the utility 
function be written as the sum of utilities pertaining to each good 
separately. 

u = Fx(xi) + V 2 (x 2 ) + • • ■ + Vn{Xn), (2) 

where the functions F* obey the law of diminishing marginal 
utility. 5 Specifically, ' 

V/( Xi ) > 0, (3) 

< 0 . 

In his Mathematical Psychics (1881), Edgeworth, going further 
than Jevons, suggested that the requirement that utility be a sum 
of functions pertaining to each good was an unnecessary and indeed 
unjustifiable assumption. He proposed, therefore, that the utility 
function be written in the form 6 

U — <p(Xi, ' , X n )i (4) 

where <p is any joint function of the quantities of all goods, and 
where 

= alfe $ °- * j) (5) 

By the end of the nineteenth century many writers, notably 
Pareto, had come to the realization that it was an unnecessary and 
unwarranted assumption that there even exist utility as a cardinal 
magnitude. Since only more or less comparisons are needed for 
consumer’s behavior and not comparisons of how much more or 
less, it is only necessary that there exist an ordinal preference field. 

5 Walras and Marshall also made the assumption that utility may be written as 
in (2). In the case of Marshall, as will be discussed later, it is not clear whether he 
really intended to be taken literally when making the assumption that utilities are inde- 
pendent, or whether he regarded this as an approximation for small mdvements under 
certain conditions. 

6 Professor Irving Fisher is also credited with the independent discovery of this 
possibility at a later date. 
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For any two combinations of goods, respectively ( Xi ° , • • • , x n °) and 
(pci 1 , • • Xn 1 ), or for brevity, (X°) and ( X x ), it is only necessary 

that the consumer be able to place them in one of the following 
mutually exclusive categories. 

a. (X°) preferred to (X 1 ) 

b. (X 1 ) preferred to (X°) (6) 

c. (X°) and (X 1 ) equally preferred or indifferent. 

For convenience, we may attach a number to each combination; 
this is assumed to be a continuous differentiable function. This 
function (or rule of numbering) may be written 

, <P - <p{X) = <p(x x, (7) 

It is so constructed that the following three conditions correspond 
to the above three respectively: 

a'. ^(X 1 ) < <p(X°) 

b'. <p(X°) < <p{X x ) (8) 

c'. <p(X°) = <p{X l ). 

<P may be designated as a utility index. The one parameter 
family of loci defined by 

c^(vi, * , Xit) “ C, 

where C is regarded as a parameter, are designated as indifference 
loci. 

It is clear that any function 

U = F(<p), F'(<p) >0 (9) 

defined by any monotonic transformation of ip, is also a utility 
index. For 

KX 1 ) $ <p(X°) implies U(X X ) $ C7(X°), respectively. 

The converse also holds. Thus, from any one utility index all 
others can be derived by a suitable functional transformation. 

To summarize, our ordinal preference field may be written 

U = Fl<p(x h ■■■,x n )J F'(ip) > 0, (10) 

where <p is any one cardinal index of utility. 

It is clear that the choice of any one numbering system or 
utility index is arbitrary. The indifference loci are left unchanged 
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by any alteration of the tags attached to each, provided ordinal 
relationships are maintained. In order, therefore, to avoid the 
asymmetry of employing any one favored utility index, many 
writers (Pareto, W. E. Johnson, Hicks and Allen, et al .) have sug- 
gested that a notation be employed which is dependent only upon 
the invariant elements of the ordinal preference field, namely, the 
indifference loci. 

The direction cosines of the tangent plane to an indifference 
locus at any point must be in determined ratios. Given any one 
utility index, we have 

\ 0 X 2 ) U=°C \OXzJv=C \dx n /u=c 

as 

F'<Pi:F'<P 2- • • ■ :F'<p n . 

We may take as given the invariant slope functions 

~ (axOtr-e = 1R '( Xl> ' ' x »)- (* = 2, • * •, n) 

These are invariant under any change of utility index, for 

IDj Uj(jX 1, ' i 

~ Ui(xi, •••,*») 


However, if we consider more than two commodities, the func- 
tions 1 R i cannot be all arbitrarily chosen. In order that there 
exist an ordinal preference field of the type described above they 
must, as Professor Fisher has pointed out, satisfy the following 
integrability conditions: 

l R/ - *Ri 'Rf = % Rx\ (14) 

so that the following so-called Pfaffian 

dxi 4- Wft + • • • 4- 'R n dx n (15) 

admits of an integrating factor y{xi, • • •, x n ), and may be con- 
verted into the exact differential 

dtp — ydx 1 + {y 1 R?)dxi +•••■+ ( y 1 R n ')dx n 
= tpidxi + cpzdxz + * * • + <p n dx„. 


F tpi^Xij * * *, Xn) 


F'<p x(*i, 

<Pt(Xi, ■ 


* t Xn) 
‘ j Xn) 


<Pl(Xi, ■ • * , x n ) 


(*« 2, •..,») (13) 


( 11 ) 


( 12 ) 


( 16 ) 
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<P i = Ti 

and 

= (T 1 ^). 7 (17) 

From this stage it was but one small step to the rejection of 
the integrability conditions. Thus, Pareto, Hicks and Allen, and 
others have been content to start out with the assumption of a 
planar element embodying indifference directions at each point. 
The latter two writers call these the respective marginal rates of 
substitution between the ith and first goods. These are written 
as in (12), but the functions are not required to satisfy the partial 
differential equations of integrability presented in (14). 

1 i? i = ' •»*»)• (i = 2 ,•••,») (18) 

The Demand Functions as a Goal 

We have seen an account of the transformations which the 
preference field has undergone through time. However, nothing 
has been said as yet as to the use to which these concepts are put 
in the explanation of consumer’s behavior. This we must now do 
in order to investigate the meaning — in the technical operational 
sense — of the various hypotheses. 

Following traditional assumptions of the pure theory of con- 
sumer’s behavior, we consider a single idealized consumer buying 
goods and services per unit time in a market whose prices he cannot 
appreciably affect. The selling of personal goods and services may 
be at times regarded as negative purchases. For present purposes 
each good and service is taken as clearly defined, homogeneous, 
divisible, etc. Let us designate all goods and services (xi, • • • , x») 
with respective given prices (pi, ••• , p„ ). Total expenditure or 
income is defined as 

n 

7 “ X ' \p 1 “f“ %2p2 i * * * T" %nftn “ pi%i* (19) 

1 

As of any given total expenditure and a given set of prices, it is 
assumed that our idealized individual will select some determined 
amounts of each and every good. (Of course, the amount of some 
goods may be zero.) That is to say, the quantity of each good is 

7 Of course, is also an integrating factor. 
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a function of all prices and income. 

*1 = h KPl> " * *» Pni I) 

^ t = h\p h ■■•,p n ,r) 

: : (20) 

%n “ h n fp\, ' ' ’ , pnt 2 )* 

These are the general demand functions. Their derivation is the 
whole end and purpose of our analysis of consumer’s behavior. As 
has been reiterated again and again, the utility analysis is meaning- 
ful only to the extent that it places hypothetical restrictions upon 
these demand functions. This is the point of view from which we 
shall proceed. 

The Marshallian partial equilibrium demand functions for the 
first good would be, of course, 

xi = h l (p u pi, • • • , p n , /) = D l (p x ), (21) 

where all other prices and income are held constant by ceteris 
paribus assumptions. A meaningful restriction upon our price- 
quantity data would be the hypothesis that an increase in one 
good’s price will, ceteris paribus, result in a decrease in its quan- 
tity; i.e., 

w. - h ‘‘ < °- (22) 

Is this derivable from the utility analysis? Can anything be said 
about dxi/dp;, the change in the quantity of one good when some 
other price varies? What about dXi/dl, the rate of change in the 
quantity of the ith. good with respect to a change in income? 
These are the questions whose answers we must seek. 

Conditions of Equilibrium 

It is not necessary that the demand curves be derived for each 
of the preference fields defined by (1), (2), (4), and (10), respec- 
tively. Fortunately, the last includes all previous formulations 
as special cases. I start out from the general case of an ordinal 
preference field, later considering the meanings of the special cases. 

The utility analysis rests on the fundamental assumption, that the 
individual confronted with given prices and confined to a given total 
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expenditure selects that combination of goods which is highest on his 
preference scale. This does not require (a) that the individual 
behave rationally in any other sense ; (b) that he be deliberate and 
self-conscious in his purchasing; (c) that there exist any intensive 
magnitude which he feels or consults. 

Our problem, therefore, is the comparatively simple one of 
finding a maximum for 

U = Fl<p{x h ( 10 ) 

subject to 

£ PiXi = I, (19) 

1 

where {pi, • , p n , I) are each preassigned parameters. 

This is a constrained maximum problem, since equation (19), 
familiarly termed the budget equation, must be satisfied. This 
restricts the choice of quantities. Without such a restriction the 
individual could presumably purchase an unlimited amount of 
goods up to the point of satiation. But in point of fact, goods are 
not all free; with a fixed income the more of one good which is 
bought, the less must be consumed of another. 

In the Appendix it is shown that we must have as a necessary 
condition for such a constrained relative maximum : 


Ui + ~kpi = 0, (i = 1, n) (23) 


where X is a so-called Lagrangean undetermined multiplier. This 
may also be rewritten in either of the two following equivalent 


forms : 

Ui 

pi 


Ui " 

' Pi 

or 

Ui 

L h 


pi 

P 2 


= 2, • • •, n) 

(24) 

A 

i 

II 

■ 

(25) 


This means that in equilibrium the ratio of the marginal utilities 
of two goods is equal to the ratio of their prices, i.e., marginal 
utilities are proportional to prices. 

It is clear from the formulation of (24) that it does not matter 
which utility index we use, for 


Ui = F'<pi. (*■« 1 , 


(26) 



Therefore, 
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( 27 ) 


Our conditions of equilibrium yield the same solution, therefore, re- 
gardless of our choice of a particular utility index. It is as meaning- 
less to argue that one particular utility index is really the true 
measure of utility as it is to argue that the earth really revolves 
about the sun and not vice versa. Only in terms of observations 
other than those envisaged in our market place can a cardinal 
utility magnitude be defined. 

The formulation of (25) gives rise to the familiar interpretation 
that in a maximum position the marginal utilities of thp last dollars 
spent for each and all commodities must be identical. This mag- 
nitude ( = — X) has been termed the marginal utility of money, 
or better still, the marginal utility of income. It will be noted that 
it is not invariant under a change of the utility index, and so for 
an ordinal field no significance attaches to its magnitude, nor to 
the rates of changes of its magnitude with respect to any variables. 

Employing the notation of indifference loci, the same conditions 
may be derived. For from (13) 


Ufah - • Xn) 

U\(X\, ' ‘ Xn) 


(i = 2, - • - , n) 


Therefore, the conditions of equilibrium of (24) may be written 

Wfa, •••,*.) - & = 0. (*- 2, •••,») (28) 

This is the familiar tangency of the budget plane with the indiffer- 
ence locus passing through the point of equilibrium. Figuratively, 
the consumer moves along the budget plane until he attains the 
position lying highest on his preference scale, which must in the 
continuous case be a position of tangency; for if the budget plane 
crossed the indifference locus, he could advance to a still higher 
position. 

We have stated our equations of equilibrium in several different, 
but mathematically equivalent ways. The formulation of (23) is 
one in which the symmetry of all variables is maintained, and so 
for definiteness we may concentrate upon it. Our budget equation 



IOO 


FOUNDATIONS OF ECONOMIC ANALYSIS 


of (19) must also be satisfied, and so our full conditions of equi- 
librium can be written 

Ui(xi, * , 0, (i 1, * * * , Ti) 

plXl H~ P&z + • • • + p n X n — 1=0. 

These conditions of equilibrium correspond to the set (1) of 
the second chapter. We wish to derive from them our demand 
functions, 

Xi = h%p u •••,£.,/), (30) 

which correspond to the equations (2) of the second chapter. 
Prices and income are regarded as data for this analysis, and we 
should like to know how our equilibrium quantities vary with 
changes in these parameters. 

Our conditions of equilibrium are (n 1) in number, and in- 
volve 2 (n + 1) unknowns, namely (— X, X\, • • ■ , x n , pu • • • , p n , I). 
Avoiding now all problems of multiplicities of solution, we may 
assume that ( n -f- 1) of our variables may be solved for in terms 
of the remaining ( n + 1). In particular (— A, Xi, • • - , x„) may be 
each solved for in terms of (pi, ■■■, p n , I). Hence, we get the 
following functions : 

Xi = h^pi, * * ’ , pm I), (i = 1, •••,«) (30) 

and 

(-X) =Kpx,---,p n ,I). (31) 

Thus, our demand functions can be derived from our conditions 
of equilibrium. We have also introduced a new variable (— X), 
the marginal utility of income, which could, of course, have been 
eliminated, but only with a loss of symmetry. Had we employed 
one of the other equivalent sets, such as (28), we should have had 
n equations between (2 n +1) variables, and so our n quantities 
could have been expressed as before in terms of the (n + 1) prices 
and income parameters. 

Displacement of Equilibrium 

We have counted our equations and unknowns, and found them 
to be equal in number. Subject to certain restrictions, this assures 
us that all our equilibrium variables are determined. There is a 
temptation to stop at this point and rest content with these 
achievements. 


THEORY OF CONSUMER’S BEHAVIOR IOI 

In view of all that has been said in earlier chapters, it requires 
no further argument to show that our task has hardly begun. 
There remains the sizable problem of deducing the qualitative 
properties of our demand functions from our knowledge of the 
properties of our equilibrium maximizing equations. 

To do so, we employ the same methods outlined in the second 
and third chapters. Let us write the total differential of the equi- 
librium equations (29) 


Undxi + U i^ixi -j- • • • -f- U i n dx n -j- ptd\ 

= (- \)dpi, {i = 1, 

pidx i -+- p-idxi + • ■ • + pndxn 

— dl — ( xidpi Xidpz 4 - • - + x n dp n ), 


or 


JL Ufjdxj + pid\ — (— \)dpi, (i = 1 , • • • , n) 

1 (32) 


y pjdxj dl y ' , %kdpk» 

i i 


These are (n + 1) linear equations in the (n -)- 1) unknowns 
[dxu • • •, dXn, d(— X)]. Their solution may be indicated as 
follows : 


n n 


Y, ( — \)Dijdpi-r(dI — £ Xkdpk)D n ±i,j 
1 x 


t 

II 

1 



D 

1 

f 



d(— X) = 

-[£(- 

i 

n~\- l&fi i H” (di 

n 

2D Kk&Pk) 

1 






D 




where 





Uu 

Un 

■ • • u ln 

P 1 







Un 

Utt 

• • • u in 

Pt 



D = 

U<s | 
Pi 

Pi 

0 

= 

u nl 

u n2 

U art, 

Pn 







Pi 

P 2 

••• pn 

0 



(33) 


(34) 


and Dij indicates the cofactor of the element of the ith row and the 
jth column. 
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The formulae in (33) give the changes in our unknowns for any 
changes in the parameters, prices, and income. As special cases, 
the following partial derivatives may be evaluated : 


dxj ( 1, j 

dpi D 

dXj JO n-f 1, j 

dl ~ D • 


(i,j =!,•••,«) 


where, of course, 
Also, 


dXj , . dXj 7 . 

SP, - h " w - h ' K 


d( X) £( \')Di l n + 1 XiDn+\,n-)-\\ 

dpi ~ D 


d( — X) _ — 2?n+l,n+l 

dl ~ D 


(35) 


(36) 


It is convenient to consider a compound term introduced first 
by Slutsky, defined as follows: 

K» = l % + XiJT' (*'.i = !.•••»«) (37) 

From (35) by substitution 

K h = (-X)^. (ii = l, •••,«) (38) 

Also define 

ri “ [ ~ + "dl ~ ] (- X) ’ (* = 1, •■*»«) (39) 

or 

0=1,---, (40) 

The expressions dxj/dpi, dxj/dl, K/i are all properties of the 
demand functions and are empirically determinable under ideal 
conditions. We seek restrictions on them. 

Inspection of the determinant D reveals that it is symmetrical 
with respect to i and j, since 

Uij » U n , 


h 



Hence, 
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i.e. 


k. - L-blEii _ (~ x)A-< _ ^ . 


(41) 


dXy SX;* <3x* 0Xi 


(«'» i = l, • • • , ») (42) 


What is the economic interpretation of the compound term 


Ka = 


dx,* , dXy . 

ap, + x >»t 1 


It has been called by Slutsky the residual variability of the jth good 
for a compensated change in the ith price.* 

This can be made more clear by the following considerations. 
Thus far, we have imagined the individual to be maximizing his 
utility as of given prices and total expenditure. A little thought 
will reveal that utility will be maximized as of a given expenditure 
only if the level of utility which is being realized is being achieved 
in the cheapest possible way; i.e., expenditure must be minimized 
as of any level of utility. If this were not so, the same level could 
be achieved with some money left over; this remnant could be 
spent to buy more goods, and hence a still higher level of utility 
could be attained. 

Along any indifference locus there exists for any set of prices 
an optimum set of purchases which minimizes total expenditure. 
That is, 

Xj = \p s [j> 1 , ■• •,pn, F(cp)2- (J = 1, ••*,») (43) 

For 

U = F(<p) = constant, 


we are confined to the same level of utility. It could be easily 
shown 4 that 



In words, K # is equal to the change in the quantity of the jth 
good with respect to the ith price, where the individual moves 

8 E. Slutsky, * ‘Sulla teoria del bilancio del consumatore,” Giornale degli economists , 
LI (1915), 19-23. 

9 See chap, iv, p. 9. 
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along the same indifference locus and keeps his expenditure down 
to a minimum before and after the change in price. 10 

From the formulation of our equilibrium equations in (29) we 
have seen that the properties of the demand functions are not 
affected by our choice of utility index. This may be shown ex- 
plicitly from the identities 


U = F(<p), (45) 

Ui = F'cpi, (46) 

Uu = F'va + F"v«p j, (47) 

(-X) = F'(~ V), (48) 


where ( — X') is 
index <p. Let 


From (47) 


the marginal utility of income for the utility 


D' = 


<Pij 

Pi 


Pi 

0 


(49) 


U u | 

Pi 


F'<pij + F /f (pi<pj | 

P'i 


F'vv 1 

Pi 

Pi 

0 


Pi 

0 


Pi 

0 


= (F') n D r . 


(50) 


Similarly, the following relationship holds for all cofactors. 

D {j = (F') n ~ l D'ij. (i,j = 1, •••,») 

Hence, 

(- \)D« _ (- X') D’u 
D ~ D' 

so that Kji is an invariant under any transformation of utility 
index. Literally nothing is implied for empirical price behavior by 
the choice of any particular utility index. 

Inspection of our equilibrium conditions in the form of (24) 
and (19) reveals that they are unaffected by a proportional change 
in all prices and income ; our equilibrium values remain intact for 
such a change; i.e., 

x i = h i (pu ■ ■ -,p n , F)=h i {mp 1 , ■ ■ (*=1, •••,») (53) 

where m is any positive number. Mathematically, our demand 

10 For another interpretation, see H. Schultz, , The Theory and Measurement of 
Demand (Chicago: University of Chicago Press, 1938), pp. 43-4-5. 


(51) 

(52) 
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functions must be homogeneous of order zero. Employing Euler’s 
theorem for homogeneous functions, we have 11 


dXi 

dpi 


px + 


dXj 

dpi 


p 2 + ’ * * "4“ 


dXi 

dp n 


Pn + 


dXi 

~dl 


1 = 0 . 


(i = l, ••*,?&) (54) 


Dividing through by x* yields the following relationships in terms 
of elasticity coefficients: 


Vil + + * * * + Vin + Vil ~ 0, 

where 

p j 

7,13 dpi Xi 


(i = !,••■,») (55) 


is the elasticity of the ith good with respect to the price of the 
jth good, and 

dXi I 
ViI ~ HTxi 


is the income elasticity of demand of the «th good. Intuitively, 
we should expect this equality, since an upward movement in all 
prices is equivalent to a decrease in money income. 


Thus far our analysis has not been completely devoid of mean- 
ing. The following empirical restrictions have been found to hold 
for the demand functions. 

I. They are homogeneous of order zero; i.e., a simultaneous 
doubling of all prices and income leaves all quantities demanded 
invariant. This implies as we have seen 


_L + . . . + A 

dpi Pl + dp 2 p2 + + dpn Pn 

or in elasticity terms 

Vil + Vil + • • • + Vin 


dXi 

dl 


I, (* = !,•••,») (56) 


vw, 


(i = 1, - ,n) (57) 


i.e., the sum of the elasticities of a good with respect to each and 
every price 'is equal in absolute value, but opposite in sign, to the 


11 This may also be proved by substitution from (35). 

dXi > 


r-' d%i dX% 

2^ — pj -f- — J = 
i d Pi dI 




x)£5f-o 


by a well-known theorem on determinants that the expansion of the elements of one 
column with respect to the cofactors of a different column must vanish. 
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income elasticity of demand for that good. These are n restric- 
tions which are not compatible with any and all price-quantity 
behavior, and so are meaningful. 12 

Because of this homogeneity condition, it is not necessary to 
take as independent variables the n prices and income. These 
(n + 1) variables may be reduced to n variables by considering the 
ratios between any n and the remaining variable. 

Thus, we may divide through by any price, say the price of 
the first good, to get 



•,») ( 58 ) 


This is equivalent to setting the price of the first good equal to 
unity, and using it as our numeraire. 

However, a more symmetrical measure suggests itself. Divid- 
ing through by I, we get 




,pn,r ) 

= ’ ••■’t)* (* = 1 -"-,») (59) 


12 From our definition of income or total expenditure as 

n 

I = Y, P&i 
1 

we have the following (n*f 1) restrictions on elasticities of demand 

X 

and 


where 


22 kiVij — &U 
1 


ft* « 


piXi 


is the proportion of income spent on the fth good. However, these are not meaningful 
restrictions, since they are consequences of our definition. At best, they could but 
reveal that we have not applied our defined operations with numerical accuracy. 
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Let 


Hence, 



0* = 1, •• - ,n) 


Xi = H { (a 1 , ■■ ■, a n ). 


(60) 


The as here are very natural units to employ, since they involve 
only the dimensions of the respective quantities. In words, a,- may 
be defined as the proportion of total income required to purchase 
a single unit of the jth good. 

II. We have also the following reciprocal “integrability” con- 
ditions. 


rr 1 QXj OXj , OXj i B \ /ten \ 

K-ji ~ 1 %i qj — Qp “f~ Xj Qp ~ Kij] ( 2 , J — 1, * * * » it) (61) 

i.e., the residual variability of the jth good for a compensated 
change in the fth price is identically equal to the corresponding 
term for the it h good with respect to the jth price. These are 
n(n — l)/2 independent meaningful conditions. 13 


Meaningful Theorems 

Thus far almost nothing has been said about the directions of 
change in our equilibrium quantities of goods demanded with re- 
spect to changes in prices and income. Does the utility analysis 
have nothing to say upon this question ? The answer can be sought 
along lines indicated in previous chapters. 

Before clouding the air with determinants, let us make a 
common sense appraisal of the situation to see whether we cannot 
suggest a simple answer. 

First, suppose the individual to be constrained to move along 
the same indifference locus. Let him be confronted with a set of 
prices, and attempt to attain this level of “utility” in the cheapest 

13 They hold, of course, only for the individual demand functions. Moreover, they 
reflect differential properties of our demand functions which are hard to visualize and 
hard to refute. For our empirical data consists of isolated points. These must be 
smoothed in some sense before our relations can be tested ; the smoothing, even by the 
best known statistical methods, is to a degree arbitrary, and so refutation and verification 
are difficult. 

I have tried, but thus far with no success, to deduce implications of our integrability 
conditions which can be expressed in finite form, i.e., be conceivably refutable merely 
by a finite number of point observations. 
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possible way. Consider the set of prices (pi 0 , • •• , p n ° ). There 
will correspond to this an optimal set of quantities (xi°, ■ ■ • , x n °), 
such that total expenditure is as low as possible, i.e., 

E Pi*Xi S Z pc°Xi°, (62) 

1 1 

where (X) is any other point on the locus 

FMX)2 = Aj>(X°)]. (63) 

Consider now a second set of prices, (pi, ■■ ■ , p n v ), and the 
corresponding optimal set of goods, (xi 1 , lying on the 

same indifference locus as the first. Then 

E p/xi s E PiW- (64) 

i i 


In equations (62) and (64) any values of x (along the same locus) 


may be inserted in the respective left-hand sides, 
we may write them respectively 

In particular, 

E Pi 0 Xi X E Pi°Xi°, 

1 1 

(65) 

and 


E pi l Xi° = E pxXi 1 . 

i i 

(66) 

This means that the optimal set of goods for each respective set of 
prices cannot cost more than the other set of goods (optimal for a 
different set of prices). 

Rewriting the equations, we get 

E p-i^x* - Xi 1 ) S 0, 

1 

(67) 

E pHx? — Xi°) s 0. 

1 

(68) 

Add these two equations to get 


E {pi 1 - PMxi 1 - Xi 0 ) s 0. 

(69) 




If the two equilibrium points are assumed to be always distinct, 
and if a proper absolute minimum is assumed to be always realized, 
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then the equality sign may be dropped, and this may be rewritten 

£ (Pi 1 - Pi°) (xi l - Xi °) < 0, (70) 

This may also be written 

D ApiAxi < 0, not all A pi = 0. (71) 

1 

Suppose we allow but one price to change, say the Ath; then 
all but one term of (71) vanishes and we have 

Ax k Ap h <0; (k = 1, • • •, n) (72) 

i.e., as the kth price changes , all other prices being held constant, less 
will be bought of the kth good. It must be emphasized that this 
holds only for a movement along the same indifference locus, i.e., 
for a compensated change in price, and does not mean that with a 
given money income a change in one price will necessarily result in a 
decreased amount taken of the corresponding commodity. It will 
be noted that the above proof does not involve the calculus at all ; 
using only the operations of addition and subtraction, the definition 
of a maximum position may be utilized to derive meaningful finite 
demand restrictions. 

Employing only the most elementary logical and arithmetical 
operations, we can advance matters still further. Consider any 
initial set of prices and income (pf, • ■ • , p n °, 7°). Corresponding 
to this there will exist one or more optimal sets of goods. Select 
one of these and designate it by (Xi°, • • •, x n °). Consider now a 
second set of prices and income (pf, ■ • • , pf, I 1 ), and a correspond- 
ing optimal set of goods (xd, • • • , xf). 

Let us consider what would have been the cost of the second 
batch of goods at the prices of the first. This will be 

PlW + pf'xz 1 + • • • + pJXn 1 = £ pi°X, X . (73) 

1 

If this cost is equal to or less than the amount of money that the 
first batch actually cost, we have conclusive evidence that the 
second batch is not higher on the individual’s preference scale than 
the first batch ; for if it were, the individual could not have been 
in equilibrium in the first place, since he would not be minimizing 
total expenditure for the attained level of satisfaction. In other 
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words, if he could have bought the second batch, and he bought 
the first, we rule out the possibility that he prefers the second to 
the first. 

Our theorem is 


£ PW s £ pW implies TJXX 1 )] s F[>(X 0 )]. (74) 

i x 

More specifically, 

£ PW < £ PiW implies Fl<p(X')l < T[>(X 0 )]. (75) 

i i 

Similarly, 

£ pW < £ PiW would imply T[>(X 0 )] < FJXX 1 )]. (76) 
x 1 

It is obvious that 


£ PM 1 < £ PM (77) 

i i 

and 

£ PiW < £ PW (78) 

i i 

cannot both hold simultaneously, for this would imply 

-PMX 1 )] < T[>(X 0 ):], (79) 

and 

7TXX 1 )] > FMX°n (80) 

which is a contradiction. 

This gives us a condition which holds for any movements, not 
merely for compensated ones. 

Equations (77) and (78) may be written 


£ #i°(»i 1 — Xi°) < 0 implies £ pp(x.p 


i 


i 


Xi°) < 0, 


(81) 


or 

£ piAxi < 0 implies £ (pi + A pi) Axi < 0. (82) 

1 1 

If we assume that our demand functions are single-valued, and 
agree to consider only distinct points, this may be broadened to 
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the following form : 

Z pAxi Si 0 implies Z (pi + Api)Axi < 0. (83) 

i i 

The importance of this result can hardly be overemphasized. 
In this simple formula are contained almost all the meaningful 
empirical implications of the whole pure theory of consumer’s 
choice. Moreover, these are expressed in the form which is most 
suitable for empirical verification. So fundamental is this condi- 
tion that (as I have shown elsewhere) it provides a foundation for 
the theory of economic index numbers and the utility analysis, and 
affords the most convenient path for the derivation of all known 
restrictions upon the individual and general demand functions. 14 

From this condition alone can be derived the following restric- 
tions upon the demand functions: 

(a) They must be single- valued ; i.e., to each set of prices and 
income there corresponds a unique set of goods. 

(b) They must be homogeneous of order zero; i.e., a change in 
all prices and income in the same proportion must leave all quan- 
tities unchanged. All the properties of condition I of the previous 
section must therefore hold. 

(c) All known valid qualitative restrictions upon the slopes of 
the demand functions. These will be indicated below. 

Elsewhere 15 I have suggested as new foundations for the pure 
theory of consumer’s behavior the conditions (a) and (b) and 
equations (83). At that time I did not realize that (a) and (b) 
were redundant in the sense that they themselves could be deduced 
as theorems from the assumption of (83) alone. In other words, 
this single condition provides us with complete foundations for the 
theory (with the reservation concerning integrability) . 

The proof of (a) and (b) as theorems can be indicated simul- 
taneously. Consider an initial price and income situation ( pi 0 , - • • , 
p n °, 1°). Corresponding there is a set of goods (aci°, • ■ x n °). 

Suppose now all prices and income to be multiplied by the same 

14 The only point upon which this formulation does not throw light is that of in- 
tegrability. Even here, a proof may still be forthcoming by which this condition may 
be slightly generalized to include the question of integrability. 

15 “A Note on the Pure Theory of Consumer’s Behavior,” Economica , February, 
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positive quantity, m, (mpP, ■ ■ • , mp n °, mP). Corresponding there 
is a second set of quantities (xi 1 , • • • , x,, 1 ). We wish to prove that 
the second batch of goods is identical, commodity for commodity, 
with the first. 

By hypothesis, 


Hence, 

I 1 = mP. 

(84) 


Z Ptei 1 = m Z pi°Xi 0 . 

i i 

(85) 

Also, 



Therefore, 

Pi 1 = mpi°. (i = 1, ■••,#) 

(86) 


n n 

Z Pi°Xi l = Z PPxP. 

1 1 

(87) 

But also 

Z PPxP = Z PiW. 

1 1 

(88) 

But this is a contradiction, for condition (83) says that 


Z Pi°Xi l 

1 

= Z Pi°Xi° implies Z pHP > Z pp Xi \ 

1 1 1 

(89) 

Therefore, these cannot be two distinct points. Hence, 



Xi 1 = Xi°. (?' = 

(90) 


For m = 1, condition (a) is deduced as a special case of (b). 

Thus far, we have made no use of the calculus. We may now 
proceed to derive conditions upon the various partial derivatives 
of our demand functions. 

Let us go through a limiting process and write (83) in the 
following differential form : 


n 



n 


Z pidxi = 0, 


not all dxi or dpi = 0. 
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In this expression the dx’s and the dp’s are differentials, not 
infinitesimal increments. 

Regarding prices and income as independent variables, from 
our demand functions of (20) we have 

dxi = E h/dpj + hpdl. (i = 1, • • •, n) (92) 


But for E pjdxj — 0, 

1 

dl = E pjdxj + E Xjdpj = E Xjdpj. (93) 

iii 

Therefore, 

dxi = E (V + Xjhi')dpj = E £!»';#>• (94) 

1 1 

Equation (91) becomes 

E L (^i' + XjhT)dpidpj 0, (95) 

1 1 

or 

E S {Kipdpidpj S 0. (96) 

1 1 

This is a negative semi-definite form ; semi-definite because for all 
prices changing in the same proportion, it vanishes due to the 
homogeneity condition. 

This is also derivable in at least two other ways. From 
equation (71) 

n 

E ApiAxi S. 0 along an isoquant. (97) 

; 1 

This may be written 

n 

E dpidxi < 0 along an isoquant 
1 

(not all differentials vanishing). (98) 
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But along an indifference locus, from (43) 

Xi = ■•■,p n , (i = 1, • ■ • , n) (99) 

where ip 1 is homogeneous of order zero in the p’s. Also 

dxi = E 'PMPi- (*" = 1, ■ • - ,n) (100) 

1 

Therefore, 

E E p/dpdpj g 0. (101) 

i i 

But, of course, for a movement along an indifference locus, i.e., 
for a compensated price change, 

w, " ~ K “ ■ (102) 

Therefore, (101) may be written 

E E Kfjdpidpj g 0. (103) 

i i 

Finally, in the Mathematical Appendix the following algebraic 
theorem is stated : Let 

ZAtsl = [^] = £A it J (*, j = 1, • • • , «) (104) 

be the first n by n matrix of the inverse matrix of [D~\. Then 

E E AM g 0 (105) 

i i 

because [D] is the matrix of a negative definite form under con- 
straint. Now from (38)' 

Ka = (- X)-4„. (106) 

Since (—X) >0, our theorem again follows. This is a direct 
algebraic proof of our theorem. 

The meaning of the requirement that the form in (96) be nega- 
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live semi-definite may be briefly indicated. 16 Let 

K=\K iS \ = 

Then because of the semi-definiteness 

\K\ = 0 and Z L Kijpipj = 0. (108) 

i i 


K u 

K u ■ 

Kin 

K 21 

K 2i ■ 

K^n 

K n i 

K n2 

K nn 


(107) 


However, the principal minors beginning with the first alternate in 
sign from negative to positive, i.e., 


|iTn!< 0; 


Kn K 12 
Kn K 22 


K u 

K n 

K u 

K n 

Kn 

Kn 

Kzi 

Kz2 

Kzz 


< 0, etc. 


(109) 


Thus, the following demand restrictions are implied. 


0*1 + x .^i < o 

dpi ^ dl ^ u ’ 


( dX{ . dXi \ ( dXj , dxj \ 

W i + x, 5i)w, + x, st) ~ 

(i 3 = 1. 


(i = 1, •• •, n) 
{ dXi 5 #A 2 


( 110 ) 


\dpj + % dl ) 


> o, 

, n), O' 5^ j), etc. (Ill) 


Condition (110) was first developed by W. E. Johnson and 
Eugen Slutsky, presumably independently. We see that it is not 
possible to deduce that 


dXi 

dpi 


0, 


( 112 ) 


the ordinary expression for the “law of demand.” For if dXifdl 
be sufficiently negative, dXif dpi can be algebraically positive. This 
is the phenomenon alluded to in the well-known Giffen’s Paradox. 17 

16 If integrability is not assumed and Kij Kju the exposition may be easily 
modified by the substitution throughout of the term (Ku -j- Ka)/2. 

17 It is only by making additional, and demonstrably arbitrary, assumptions that 
various writers have been able to derive the so-called law of diminishing demand. 
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The assumption that the form in (96) be symmetrical and nega- 
tive semi-definite completely exhausts the empirical implications 
of the utility analysis. All other demand restrictions can be de- 
rived as theorems from this single assumption. These are bold 
statements, but they are substantiated by the fact that it is possible 
to work backwards from the assumption of (96) to an integrable 
preference field displaying the properties necessary fora maximum. 18 

Conclusion 

We have come a long way in this chapter. Despite its lofty 
beginnings, the pure theory of consumer’s behavior, when its em- 
pirical meaning is finally distilled from it, turns out to be one simple 
hypothesis on price and quantity behavior. This may be written 


for 


Z (Pi + &Pi)AXi < 0, 

x 


Z PiAxi s 0, not all Ax* = 0, 


subject to the qualifications indicated above. Alternatively, we 
may write this as 

dXj 

IT 


» » /Sy. 

r y(— i 
\dpi 


Z £ 

i i 


+ Xj 


dpidpj s o, 


18 Only a sketch of the proof of this statement need be given. Write 
Xi = * , a n ), O' = 1 , • * *, n) 


dx% BXi f dXi 


^ dXi \ 


Define a new set of variables 

(3i = f(ct h • • • , ct n ), 


such that 


and 


dXi 


or ah = F k (fi i, - ", j5 
***,«] = u 
dXj 


* j 


m Gf as K? m Kj m Gf 
eft 


Then there exists a function 




or 


<p = <p(Xi f • • ',X n ) t 
which satisfies the properties of our preference field. 
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where the equality sign holds only for all prices changing in the 
same proportion. 

Many writers have held the utility analysis to be an integral 
and important part of economic theory. Some have even sought 
to employ its applicability as a test criterion by which economics 
might be separated from the other social sciences. Nevertheless, 
I wonder how much economic theory would be changed if either 
of the two conditions above were found to be empirically untrue. 
I suspect, very little. 

A Note on the Demand for Money 

One special problem in the theory of value has beqp touched 
upon at various places in the previous discussions, namely, the 
value of money itself. Probably more has been written upon this 
subject than upon any other in economics, and most of the issues 
raised are not germane to the present investigation. However, it 
is a fair question as to the relationship between the demand for 
money and the ordinal preference fields met in utility theory. In 
this connection, I have reference to none of the tenuous concepts 
of money, as a numeraire commodity, or as a composite commodity, 
but to money proper, the distinguishing features of which are its 
indirect usefulness, not for its own sake but for what it can buy, 
its conventional acceptability, its not being “used up” by use, 
etc., etc. 

The most interesting problems arising in connection with money 
are linked up with the fact of “uncertainty” in the most general 
sense, leading into liquidity considerations which cannot be dis- 
cussed here. However, it should be possible to indicate in a few 
pages the way out of certain false dilemmas connected with the 
demand for money for so-called transactions purposes . 19 

On the one hand, there are writers like Mises who would explain 
the value of money in marginal utility terms such as might be 
applicable to any commodity; on the other hand, to economists 
like Schumpeter the peculiarity of money lies in its essential lack of 
direct usefulness and in the fact that its value is not capable of 
explanation in the usual utility terms. On the whole the latter 

19 For a summary of the extensive Continental speculations on these matters see 
Howard S. Ellis, German Monetary Theory , 1905-1933 (Cambridge, Mass., 1934), Part I. 



118 FOUNDATIONS OF ECONOMIC ANALYSIS 

view is the least misleading, but, as Walras showed many years 
ago, it is possible to modify utility analysis so as to take account 
of the peculiar properties of money. The latter, who above all 
others developed the notion of general equilibrium in which all 
magnitudes are simultaneously determined by efficacious inter- 
dependent relations, was able to remain undisturbed by the fears 
of literary writers that there was something viciously circular in 
assuming the existence of prices and of a “value for money” in the 
midst of the process by which that value was to be determined. 
Today after the recent contributions of Keynes, it is particularly 
rewarding to go back to reexamine the elaborate discussion of 
liquidity preference, encaisse desiree, etc., in Walras. The latter 
was so sophisticated as to have outgrown the quantity equation in 
later editions of his work, although he continued to believe in what 
is today called the “quantity theory”; very rightly, in my opinion, 
he in effect reversed the commonly met dictum that “the quantity 
theory should be scrapped, but the quantity equation is useful.” 

Here I shall only consider the demand for the holding of money 
by the consumer. As before, ordinal utility or preference depends 
upon all commodities, but the ( n + l)th good, M, will be taken to 
be money, which yields benefit only in its ultimately being given 
up. Possession of an average amount of it yields convenience in 
permitting the consumer to take advantage of offers of sale, in 
facilitating exchanges, in bridging the gap between receipt of in- 
come and expenditure, etc. The average balance is both used and 
at the same time not used; it revolves but is not depleted; its just 
being there to meet contingencies is valuable even if the con- 
tingencies do not materialize, ex post. Possession of this balance 
then yields a real service, which can be compared with the direct 
utilities from the consumption of sugar, tobacco, etc., in the sense 
that there is some margin at which the individual would be in- 
different between having more tobacco and less of a cash balance, 
with all of the inconvenience which the latter condition implies. 

But there is this difference. Given physical amounts of to- 
bacco, food, ballet, etc. have significance in terms of the want 
pattern of the consumer, but it is not possible to attach similar 
significance to a given number of physical units of money, say to 
a number of ounces of gold. It would be otherwise in the case of 
gold which was to be used to fill teeth, but such uses of money in 
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the industrial arts we purposely neglect. The amount of money 
which is needed depends upon the work that is to be done, which 
in turn depends upon the prices of all goods in terms of gold. 

The above remarks are by now so well known as to appear trite 
and trivial. But let us translate them into mathematics. Our 
ordinal utility is now a function, not alone of the physical quan- 
tities of goods, but it has within it prices as well. This is a serious, 
significant alteration, for, as we shall see, the empirical properties 
of the demand functions are changed by this innovation. This is 
not the only case in which economists have found it necessary to 
introduce prices into the indifference loci ; there is also the example 
of goods which have snob appeal, or scarcity appeal, which are 
valued for their exclusiveness so that preference for therfl is altered 
by changes in their relative prices. This Veblenesque effect need 
not detain us here. 

Our utility function will be of the form 

U(Xly ' , %nj hi fimj fill * , fin ) 

== U (x x, *, hi \fi m t y^fih j hfih} 

•••,£), ( 113 ) 

where the function is homogeneous of order zero in all of the prices, 
so that a doubling of all prices (including the price of gold) at the 
same time that quantities are unchanged will leave ordinal utility 
unchanged. This stems from the hypothesis, which need not be 
true in the short run or under particular expectations, that money 
is evaluated only in terms of the work which it has to do. It will 
be noted that I have not set the price of money equal to one. 
Actually, we shall avoid confusion in thought if we refrain from 
doing so. Any other commodity may be used as numeraire, or we 
may express prices in any units we like. Of course, it will still be 
possible to speak of the gold price of things, and after we have 
learned to do without the use of the money commodity gold as 
numeraire, we can then fall back upon the simple convention of 
expressing prices in terms of it. Of course no reader will think 
that I attach any particular importance to gold or any other metal; 
any conventional unit which serves as money will do. 
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Walras was careful to point out another important distinction : 
consumption of goods is a flow per unit time, so much tobacco per 
year, etc., but the gold balance is an inventory or stock. We may 
speak of its price in two senses, the price of gold compared to the 
prices of other things, as one ounce of gold is worth two beaver 
skins, while a pound of tobacco is worth three beaver skins, or an 
ounce of gold is worth two “any things,” while a pound of tobacco 
is worth three “anythings.” In our notation this is p m . But we 
may also speak of the price of the use of gold per unit time. In a 
capital market where people can both borrow and lend at a given 
rate of interest, this price is necessarily related to the rate of in- 
terest. This is true even if the individual in question does not 
have to stay in debt for the amount of his cash balance ; in any case 
there is the opportunity cost of holding money in the sense of the 
interest he might have earned by lending this sum. 

If we wish to exclude dynamical considerations from our dis- 
cussion as much as possible, the simplest assumption would seem 
to be that the individual maximizes the above expression subject 
to the following budget equation, 

Z PiXj + rp m M = I. (114) 

where prices, income, and interest, r, are given to the individual. 
Thus, the amounts paid out (or foregone) in every period for the 
use of money are treated as subtractions from income available for 
expenditure upon consumer’s goods. The conditions of equi- 
librium are exactly as in chapter v, equation (29), except that the 
marginal utilities of the goods are affected by the level of prices 
directly, and we now have an additional unknown, M, to be de- 
termined. But we also have an additional equation, 

f) TJ 

~ + \p m r = 0. (US) 

Thus, the use of money is bought up to the point where its marginal 
utility (convenience) is proportional to its cost, or the interest 
charges which must be paid for its use. The secondary inequalities 
are exactly as in the usual case, i.e., the principal minors of the 
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bordered Hessian must oscillate in sign, with money treated as an 
(n + l)th good. 

From the conditions of equilibrium our demand curves are as 
follows, 

X% h \plj * > Pny Pm, == 1 , * " * , ft) {11 

M = M(p h • • •, p n , pm, I, r). 

These equations are homogeneous of order zero in all prices and 
income just as in the usual case of demand. With the interest 
rate unchanged, doubling all prices (including the price of gold) 
and income will leave all quantities unchanged. However, if we 
double all prices and income in terms of the price of gold, then 
with r unchanged, the demand for physical quantities # of money 
will be exactly doubled. Mathematically M is a homogeneous 
function of order one in terms of all other prices and income. This 
is the peculiar feature of money. Another way of saying the same 
thing is the assertion that the demand for money in terms of its 
own price, all other prices held constant, is necessarily of unitary 
elasticity. This is a familiar proposition in the history of the 
quantity theory. The reader will note that this is a meaningful, 
refutable hypothesis which is capable of being tested under ideal 
observational conditions . 20 

The further qualitative properties of the demand schedules can 
be determined by differentiation of our equilibrium equations. 
However, they will not be so simple as those in the usual demand 
case, because in varying any one price, we shift every equilibrium 
equation through the direct influence of each price on marginal 
utilities. Still, income elasticities behave more or less the same as 
before; also the demand for money in terms of the interest rate 
behaves just like the demand for any good in terms of its own price 
under the usual utility assumptions. Thus, if interest is raised, 
and at the same time income is increased so as to leave the indi- 
vidual equally well off, then the amount of money demanded must 
fall. If money is not an inferior good, and we should expect it not 
to be, then an increase in interest will lower the amount of money 
demanded even for an uncompensated change. 

20 Jacob Marscbak, “Money Illusion and Demand Analysis,” Review oj Economic 
Statistics , February, 1943, pp. 40-48. 
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The complete results of such differentiations are given below 
in partitioned matrix form . 21 


h/ 


■hp 

• hr* 





Mj 
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M t 
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-1 

dUi , w 

dUi 
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dpi +MlJ 
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0 
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Pm 
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Xj 

rM 

• -i 

* pTnJ\/£ 


( 117 ) 


We may leave as an exercise to the interested reader to verify 
this and to work out the interesting and special case where the 
ordinal preference field takes the simple form U(x-i, • • • , x H , p m M/I). 

Qualifications Introduced by Uncertainty 

The above analysis of the demand for money confines itself to 
the so-called transaction aspects. The really interesting problems 
arise when uncertainty as well as transaction friction is admitted 
into the picture. Some of the best lines ever written in this con- 
nection are those to be found in Keynes’s General Theory and in 
Hicks’s Value and Capital. Space can be devoted to only one com- 
ment here. In the past the structure of interest rates and assets 
has been subject to only perfunctory notice so that the recent 
analysis in terms of liquidity preference is of the greatest im- 
portance. But it would be a mistake, which would be as damaging 
to further analysis of liquidity preference as it would be to classical 
doctrines, if it were thought that uncertainty and liquidity differ- 
entials are the sine qua non for the existence of a rate of interest. 

21 Reference may be made here to the valuable contribution of C. E. V. Leser, “The 
Consumer's Demand for Money,” Econometrica, vol. XI, no. 2 (April, 1943), pp. 123-140. 
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Such a view can be compared with a theory of land rent based upon 
differences in the quality of different kinds of land. I believe that 
the analogy is not a superficial one. 

Specifically I think it would be misleading to say that interest 
is simply the price of money ; it would be quite possible to have an 
economy in which money did not exist, and in which there still was 
a substantial rate of interest. 22 Nor can I agree with all of the 
formulations of Hicks’s brilliant chapter xiii on Interest and Money, 
such as . . it must be the trouble of making transactions which 
explains the short rate of interest” (p. 165);” ... if bills are 
are perfectly safe, [there would be] no reason for the bills to fall to a 
discount” (p. 165); “Securities which are not generally acceptable 
in payment of debts bear some interest because they are imper- 
fectly ‘money’ ”(p. 167). 

It is true that in a world involving no transaction friction and 
no uncertainty, there would be no reason for a spread between the 
yield on any two assets, and hence there would be no difference in 
the yield on money and on securities. Hicks concludes, therefore, 
that securities will not bear interest but will accommodate them- 
selves to the yield on money. It is equally possible and more 
illuminating to suppose that under these conditions money adjusts 
itself to the yield of securities. In fact, in such a world securities 
themselves would circulate as money and be acceptable in trans- 
actions; demand bank deposits would bear interest, just as they 
often did in this country in the period of the twenties. 23 And if 
money could not make the adjustment, as in the case of metal 
counters which Aristotle tells us are barren, it would pass out of 
use, wither away and die, become a free good. 

In the preface to the reissue of Risk, Uncertainty and Profit 24 
Frank Knight makes the penetrating observation that under the 
conditions envisaged above the velocity of circulation would be- 
come infinite and so would the price level. This is perhaps an 

22 Elsewhere I have developed this notion briefly under the title “The Rate of 
Interest under Ideal Conditions,” Quarterly Journal of Economics , LIII (February, 
1939), 286-297. 

23 In a world without uncertainty where money has the same yield as other assets, 
its velocity becomes indeterminate. Note that this is the case when the interest rate 
is zero. 

24 Risk, Uncertainty and Profit (London School of Economics and Political Science: 
Series of Reprints of Scarce Tracts, no. 16, 1933), p. xxii. 
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over-dramatic way of saying that nobody would hold money, and 
it would become a free good to go into the category of shells and 
other things which once served as money. We should expect too 
that it would not only pass out of circulation, but it would cease to 
be used as a conventional numeraire in terms of which prices are 
expressed. Interest bearing money would emerge. 

Of course, the above does not happen in real life, precisely be- 
cause uncertainty, contingency needs, non-synchronization of 
revenues and outlay, transaction frictions, etc., etc., all are with us. 
But the abstract special case analyzed above should warn us against 
the facile assumption that the average levels of the structure of 
interest rates are determined solely or primarily by these differ- 
ential factors. At times they are primary, and at other times, such 
as the twenties in this country, they may not be. As a generaliza- 
tion I should hazard the hypothesis that they are likely to be of 
great importance in an economy in which there is a “quasi-zero” 
rate of interest. I think by this hypothesis one can explain many 
of the anomalies of the United States money market in the thirties. 

A further defect of the reasoning that I have been criticizing lies 
in the tendency to regard as universal the hypothesis that interest 
yield is inversely correlated with an asset’s nearness to money so 
that normally long rates are above short rates. This is not in 
accord with much of economic history for reasons which I do not 
think are difficult to elucidate . 25 

26 See D. Durand, “Basic Yields of Corporate Bonds, 1900-1942,” Technical Paper 3 
(New York: National Bureau of Economic Research, 1942). Also F. A. Lutz, “The 
Structure of Interest Rates,” Quarterly Journal of Economics , LV (1940), 36-63. 
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TRANSFORMATIONS, COMPOSITE COMMODITIES, 
AND RATIONING 

Logarithmic Transformations and Elasticities 

Through the influence of Alfred Marshall economists have de- 
veloped a fondness for certain dimensionless expressions called 
elasticity coefficients. On the whole, it appears that their impor- 
tance is not very great except possibly as mental exercises for be- 
ginning students . 1 As we have seen, most of the “laws of eco- 
nomics” are qualitative and ordinal rather than quantitative, and 
where such are concerned the problem of dimensions is of no 
consequence. 

Furthermore, while elasticity expressions are invariant under 
changes of scale, they are not invariant under changes of origin. 
Since there are no natural zeros from which we measure economic 
magnitudes, the elasticity expressions can be seen to be essentially 
arbitrary. Thus we meet in economic analysis such concepts as 
exports, net purchases, amount of factors supplied, etc., all of 
which are differences measured from arbitrary bases . 2 

Mathematically an elasticity expression between two magni- 
tudes, such as price and quantity, consists simply of the logarithm 
of one of these quantities differentiated with respect to the log- 
arithm of the other. Thus, 

Vx P — ( dx/dp)(p/x ) = d log x/d log p = Ex/Ep. z 

1 There Is perhaps some usefulness of the concept of elasticity of demand as giving: 
an indication of the qualitative behavior of total revenue, but even this is only the 
consequence of neglecting to deal with total revenue directly. 

2 This was realized, somewhat inadequately, by Wicksteed who denied the validity 
of the concept of supply, preferring to use the notion of “reserve demand”; only he 
went to the opposite extreme in attaching sacredness to his reformulation. 

8 A calculus of operations with elasticity terms can be devised on the basis of the 
properties of logarithmic derivatives. See R. G. D. Allen, Mathematical Analysis for 
Economists (London, 1937), chap, x, particularly the reference there to the work of 
D. G. Champernowne. 
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Not only are elasticity expressions more or less useless, but in 
more complicated systems they become an actual nuisance, 4 con- 
verting symmetrical expressions into asymmetrical ones, and hiding 
the definiteness of quadratic forms. This could be illustrated by 
utility analysis, but can be amply demonstrated by the slightly 
simpler case of profit maximization, where constraints do not enter 
the picture. Let us consider a firm buying inputs (»i, - • • , v n ) in 
perfectly competitive markets at given prices (w u • • •, w n ). With 
the demand curve for its product and its production function being 
known, total revenue becomes a determinate function of the inputs 
bought. Total expenditure being defined as the sum of the 
amounts paid to all the factors of production, it is clear that the 
firm’s profit can be written in the form 


x = R(v u •••,»„)-£ WjVj. ( 1 ) 

1 

For profits to be at a regular maximum, we must have 

Ri(v I, ■ ■ •, v n ) - Wi = 0 , (i = 1 , • • • , n) 

negative definite. (2) 

Suppose we are interested in the inverse demand functions 

giving the amount of each factor of production which will be bought 
at each set of prices. By explicit differentiation of our equilibrium 
conditions above, we find 

[s£] “ <3) 

Since the inverse of a symmetrical negative definite matrix is itself 


4 Actually it is a little misleading to say that an elasticity expression is necessarily 
“without dimension.” For take any absolute derivative , such as dx/dp t which is cer- 
tainly not dimensionless, involving as it does the dimensions [output times output divided 
by value]]. Even though it has dimensions, it is still the elasticity of some expression. 
Thus if 

* = f<p), 

and 

y = -*( 2 ). 

where 
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symmetrical and negative definite, the complete conditions on the 
demand functions are summarized in the above expression. 

If one were interested in the corresponding elasticity coefficients 
[_Evi/Ewj~\, one could as an afterthought introduce suitable factors 
into the matrix of equation (3). This usual procedure has been 
termed by Professor Lange the “ indirect ” method . 5 He suggests 
an alternative “direct” method. I should like to point out a third 
method, which carries out the Lange procedure to its logical con- 
clusion. Before doing so, however, I should like to express the 
opinion that in this matter “indirectness” is a virtue rather than 
a drawback. 

It will simplify the discussion if we adopt a mnemonic notation 
whereby the Jacobian of one set of variables with respect to another 
is written in a form reminiscent of that for a single derivative; i.e., 


J(yi> • • • , y»; x h ■ ■ ■ , x n ) - £ ^ j - (4) 

The reader may then verify the identities 

dzdy _ dz 
dydx ~ dx ’ 

(5) 

dx = \dyl~ 1 

dy 


which parallel exactly those for single derivatives. 

In terms of this notation the Jacobians of the transformations 


Vi = log Vi, Vi = 

Wi = log Wi, Wi = 

e v< 

e Wi 

(i = 1 ,•••,») 

can be written as 



dV rs y ] 

dv 

“ \jP ] 

dv ~ L Vi J ’ 

dV 

dw _ r5«i 

dw j_ Wi J ’ 

dw 

dW 



( 6 ) 


( 7 ) 


where 5# is the Kronecker delta, equal to one for like subscripts 
and vanishing for all others. 

5 Oscar Lange, “Theoretical Derivation of Elasticities of Demand and Supply: The 
Direct Method,” Econometrica , X (1942), 193-214. 
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Then it follows that 


dV _ dV dv dw 
dW ~ 'fofadW 


L v 0 J 


( 8 ) 


This is the so-called indirect method. The Lange direct method 
involves computing the same matrix by the following equality 


dV 

' dWl - 1 

' VjRi, " 

dW~ 

_dV J “ 

Wi 


There seems to be no special advantage in this procedure; on 
the other hand, there is the great disadvantage of loss of symmetry 
prior to inversion, and a cloaking of the definiteness properties of 
R and its inverse. Of course, what is covered up can later be 
unscrambled, but a good deal of waste motion would seem to be 
involved. 

However, if we wish to carry the direct method to its final con- 
clusion, it would seem logical to replace the v’s and w’s in the 
original profit expression by their equivalences in terms of the 
variables V and W. With the latter being given, the firm would 
vary the F’s so as to maximize profits. The conditions of equi- 
librium would be 


and 


Si(V i, •••, F.) - gW* = 0, (i =1, • ••,#) (10) 

• [ 1 * 8 * 0 " — Id 


negative definite. In the above expressions it is to be understood 
that 

S(V U •••, V n ) (11) 

and that subscripts stand as usual for differentiation. 

If we differentiate our equilibrium conditions, we easily derive 
the identity 

( 12 ) 

where II~ 1 is symmetrical and negative definite. Thus, except for 
column factors the elasticity matrix is symmetrical. By dividing 
through with these factors, and defining Ki = vfWi/Y. v.fw h we 
easily arrive at the familiar identity 6 

R iV ij — KjTjji. (Id) 

6 H. Schultz, Theory and Measurement of Demand (Chicago, 1938), chap. xix. 
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Also we may verify the relations 


V a < 0 , 


V a Va 
Vji Vjj 


> 0, etc. 


(14) 


Relations (13) and (14) are also immediately derivable from the 
indirect method, but not easily from the Lange method. 

If one must use “dimensionless” expressions, it would seem de- 
sirable to replace the usual elasticity expression by the more sym- 
metrical coefficient 


dVi 


M iS = , . — 

dWj V vd) 


fWi 


(15) 


The reader can easily verify that symmetry and definiteness are 
preserved in all matrices, that these coefficients are pure numbers, 
which in the case of diagonal elements reduce to the old definition 
of elasticity. 

I should like to conclude this discussion of elasticity expressions 
with a warning that the transformations break down when any of 
the variables are zero or change sign. In earlier sections it has 
been shown that productive services or inputs can be regarded 
simply as negative outputs. These can be converted into log- 
arithms only after a reversal of sign. This would offer no problems 
were it not for the fact that many of the magnitudes are neither 
always positive nor always negative. As an illustration of the 
difficulties which may arise, consider the classical case where the 
total of all expenditure is taken as zero. Here the partial deriva- 
tives of the various goods with respect to hypothetical income 
changes have a definite meaning, whereas their elasticities cannot 
be defined. 


General Transformation of Independent Variables 

The logarithmic transformation discussed in the previous sec- 
tion is but one special case of the transformation of our inde- 
pendent variables. It is desirable to examine the behavior of our 
equilibrium conditions under general transformation of our inde- 
pendent variables because in economics as in other disciplines we 
wish to free ourselves from any particular scheme of reference, or 
coordinate space, in favor of generalized coordinates. This desire 
is not only aesthetic but is necessitated by the fact that in real life 
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we do not always meet naturally defined commodities. In business 
cycle theory we often use the concept of “the cost of living,” taken 
to be the price of some composite market basket of commodities. 
But even if we confine our attention to what is ordinarily called a 
commodity, such as “wheat,” we find ourselves dealing with a 
composite commodity made up of winter wheat, spring wheat, of 
varying grades. Each of these in turn is a composite of hetero- 
geneous components, and so forth in infinite regression. 

There is a related reason for analyzing the effects of a general 
transformation. Among the many recent advances contained in 
Professor J. R. Hicks’s Value and Capital perhaps the most impor- 
tant from an analytical point of view was the enunciation of the 
principle that a group of commodities has the property of a single 
commodity if their prices all change in the same proportion. This 
theorem had been partially anticipated by Professor Leontief, 7 but 
it was Professor Hicks who made it the cornerstone of his exposi- 
tion. Every mathematical economist, working in many variables, 
finds difficulty in explaining his theories in words or diagrams. By 
means of the Hicks-Leontief theorem all but one of the variables 
can be lumped together as a single variable, and a tolerably 
simple literary explanation can be phrased. We shall see how this 
theorem emerges as a special case of general theorems concerning 
transformations. 

The most general conditions of equilibrium are of the type first 
encountered in chapter ii, equation (1), which we may rewrite in 
matrix form with suppressed subscripts as 

f(x; a) - 0. (16) 


If we subject the independent variables to the non-singular 
transformation 

x = T(x), (17) 

whose Jacobian 

- = rr-H 

dx 11 3 J 

never vanishes, then our equilibrium conditions become 

f[T{x) ; a] = f(x; a) = 0. (18) 

T W. Leontief, “Composite Commodities and the Problem of Index Numbers/’ 
Econometrica , vol. IV (1936). 
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If the relationship between the equilibrium values of our un- 
knowns and the parameters was given by 


* = g(a), 

(19) 

the new relationship will be 


x = g(a) 9 

(20) 

which will be equivalent to 


3 

il 

3 

(21) 

Most importantly, it can be easily verified that 


dx dx dx 

da dx da * 

(22) 

We may turn directly to the general extremum problem where 
some quantity is to be at a relative maximum or minimum with 
respect to n independent variables, which are themselves under 
m constraints. 8 In matrix notation we have 

^0 

II II 

(23) 

At a regular maximum it is necessary that the matrix 


C U <?.] 

(24) 

be of rank (n — m), and that 


h'Hxxh 

(25) 

be negative definite subject to 

h'G x = 0. 

where 


m 

Hxx — C fxi*i + 2 

1 

(26) 

where the X’s are Lagrangean multipliers. 

After transformation we have 


- [I]' V4A 

(27) 


* See Mathematical Appendix A, Section III. 
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where prime on a matrix [a.;J indicates its “transpose” 

Since the first matrix on the right-hand side is non-singular, the 
first order rank conditions are invariant under transformation. It 
is also easy to show that 

h'H^h = h'Hxxh, (28) 

where 

h'Gx = h'G x — 0, 


h 


_ ~ dx~ 
|_ dx _ 


h 


so that the second order conditions are invariant under trans- 
formation. 

Alternatively this might have been proved by showing the in- 
variance of the relevant bordered determinants in consequence of 
the relation : 


(29) 


It will be noted that the Lagrangean multipliers (Xi, • • • , X W1 ) 
are invariant under transformations of independent variables. 
There are economic as well as mathematical explanations of this. 
If we insert parameters (ai, • • • , a m ) in our equations above, so that 
our constraints become 


"ffs Gx 



O 

» 

~ II XX 

Gxl 

dx 

dx 

0 

0 

1 


1 

O 

GJ 

0 . 

0 



G ,s (x) — <Xh — 0 , 

then one can easily verify by direct differentiation that 


'ill 
_ da k J, 


other as constant 


- C- **]• 


(30) 


(31) 


Earlier we have derived special cases of this in connection with the 
marginal utility of income (chapter v, p. 100) and marginal cost 
(chapter iv, p. 65). 

The above relationship can be derived heuristically by differ- 
entiating II with respect to a k , holding all other variables constant. 
As with other operations involving Lagrangean multipliers, this is 
an analytic “trick” whose justification lies in its equivalence with 
expressions which can be rigorously derived by more roundabout 
methods. 
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Explicit attention is called to the fact that transformation of 
the independent variables of an extremum problem subjects the 
matrix of the relevant quadratic forms to a cogredient congruent 
transformation which leaves the definiteness of the quadratic form 
unchanged, and also leaves the signs of principal minors of a given 
order intact. This is in consequence of the equivalence of our 
equilibrium equations to an extremum condition, and need not be 
true for other types of equilibrium. 

Thus, when Hicks 9 sets down as a condition of perfect stability 
under general market exchange the requirement that all the 
principal minors of a matrix 

_ rf*n 

bpi\ 

be positive, we have as yet no way of knowing whether or not this 
is independent of the particular coordinate system chosen. The 
same may be said of his concept of “imperfect stability.” The 
reason for the ambiguity lies in the inapplicability of the proofs 
given above to asymmetrical matrices. 10 

This section has dealt with the invariance of an extremum posi- 
tion under transformation of independent variables. We turn now 
to the problem of transformation of the dependent variable. 

Transformation of Dependent Variable 

It is by now well known that the equations of demand are 
independent of the particular choice of utility index; 11 i.e., all of 
their properties are invariant under a general transformation 

U = F(<p), (32) 

where the latter is a one to one, always increasing function. For 
simplicity we may assume that it is twice differentiable. 

In the most general case where f(x) is to be made a maximum 
subject to one or more constraints on the x's given by 

G(x) = 0, (33) 

9 J. R. Hicks, Value and Capital (London, 1939), chap, v and Mathematical Ap- 
pendix, pp. 315-317. 

10 This problem is discussed again later in this chapter. 

11 See chap, v, pp. 12 and 19. 
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the equilibrium conditions are completely expressed by the condi- 
tion that \JxG x ~\ be of certain rank, and that the quadratic form 
h'H xx h be of certain definiteness for values of h satisfying 

h'G x = 0. 12 (34) 

Now if we subject z and the G’s to the transformation 

F = F(z) and E = E{G), (35) 

and ask that the new variable F be maximized subject to the new 
constraints, our conditions involve the rank of [_F X E X ~\ and the 
definiteness of h'R xx h, subject to 

h'E x = 0, (36) 

where * 

R xx = iFa + £ mkEtft. (37) 

1 

By actual differentiation we easily find the relationships between 
the new and old matrices, namely 

E ,( G) } (38) 

Since the last of these matrices is a diagonal matrix which is to be 
non-singular, the first order rank properties are clearly invariant 
under this transformation, as is the actual solution in x itself. 

We may similarly verify by differentiation that 

h'Rxxh = h'tF'fa + £ m*£/(G*)GV]/* 

1 

+ h'lF'-’fifi + £ m^'GfGfth. (39) 

i 

From the first order rank conditions, there must be linear de- 
pendence between the f x and G x columns of precisely the sort to 
make the second part of the right-hand side of the above equation 
vanish for admissible h' s. From the monotonic property of the 
functions F and E it follows that [F'Ek'~\ are non-vanishing; hence 
the transformed quadratic form retains its definiteness. 

12 See Mathematical Appendix A, Section V, 
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This completes a sketchy proof of the invariance of our equi- 
librium conditions. This whole problem is of interest primarily in 
connection with utility and consumer’s choice theory, but our 
theorem applies just as well to the maximization of profit. If we 
maximize the square of profit, or the log of profit, or 90 per cent of 
profit, or that monotonic function of profit which the Treasury 
tells us we can retain as net income, there will be no change in our 
optimum price and output policy. In connection with the produc- 
tion function the combining of the factors of production so as to 
produce a given output most cheaply could be done as well if we 
worked with some indicator of output rather than with output 
itself; alternatively, we might work with the iso-product contours 
alone, taken without regard to their numbering. However, it is 
important to realize that the variables, profit and output, are 
measurable under ideal conditions ; consequently it is artificial and 
unnecessary to replace them by a transformed variable. 13 

Explicit evaluation of the new Lagrangian multipliers shows 
that they have been changed by the transformation as follows: 


_ 

mk ~ E k '{G k ) ' 


(40) 


The significance of this for the marginal utility of income appears 
in the discussion in chapter vii. 


Transformation of Prices 

Thus far I have considered transformations of the unknown 
variables, leaving the parameters, a , unchanged. We are able to 
evaluate changes of the new variables with respect to a by equation 

13 Can those who have a fondness for cardinal utility find support for their position 
by analogy to other disciplines, such as physics? In my opinion, no. Certain physical 
concepts, such as temperature, were for a long time treated by the physicists as ordinal 
magnitudes. Their behavior could be described by any one of many different indicators, 
whose relationship to each other was not linear. Finally it was deemed convenient to 
give conventional, privileged position to one such index, namely that based upon the 
properties of a “perfect gas.” Nor is the use of the word “force” in physics books a good 
precedent for utility analysis; much of physics can dispense with the term, and in any 
case the things for which it stands, such as rate of change of momentum, or derivative 
of a potential function, can be given strict observational values, which are not invariant 
under monotonic transformation. Perhaps a better precedent for economics is that of 
statistics, where likelihood or its logarithm is used indifferently depending upon nu- 
merical convenience, and without metaphysical connotations. 
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(22). However, in the new coordinate system we are no longer 
able to state definite theorems regarding the signs of these changes. 
In the old system we could definitely state that 


dvi 

dWi 


< 0 , 


(41) 


but our new coefficient ( dvi/dWi ) is a linear combination of terms 
of the form (dvi/dwi), the sign of which can be anything. 

And yet, since there was nothing privileged about the first 
coordinate system, it is manifest that there must be somewhere in 
the new coordinate system theorems no less definite than in the 
old. As we shall see, this “hunch” is correct. There are such 
theorems,^ and they are to be derived by considering the changes 
of our new variables with respect to specified composite changes in 
our parameters. That is, at the same time that we subject our 
equilibrium unknowns to a transformation, we subject our param- 
eters to a transformation. The latter transformation is not the 
same as the former, but, as will be shown presently, the transforma- 
tion of parameters is related in a definite manner to the trans- 
formation of quantities. 

For the present purpose, it is not necessary to examine the 
problem in full generality. We may consider the important case 
of a firm maximizing profit as in the first section of this chapter. 
The case of an extremum under constraint, such as appears in pro- 
duction and utility theory, can be developed in a straightforward 
manner by the reader. 

Let profits be written in the form 


ir(v ; w) = R(v h •••,»„) — £ (42) 

i 

where the w’s are parameters taken as given by the firm, and the 
v’s are productive inputs. The v’s and w’s are just special cases of 
x’s and p’s, having specific reference to inputs. 

If now we define new commodities (i>i, ■■■, v n ), we ought to be 
able to find definite relations between their changes and changes 
in their prices, just as we have been able to do in the old coordinate 
system. But what are we to mean by the prices of the new com- 
modities, (tz>i, • • • , i v n ) . Only one answer is possible, as must be 
obvious to anyone who has worked with the price of a market 
basket of goods. 
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The price of the latter is reckoned as a weighted sum of the 
individual prices, where the weights are chosen so that the total 
value of the new commodity, equal to wv, is the same as the value 
of the component parts. This suggests a general rule to be fol- 
lowed. Throughout all transformations, value magnitudes are to 
be conserved ; i.e., 

n n 

E wm = E ®kh (43) 

1 1 

regardless of the transformation. For this is a value magnitude 
(dollars, pounds, etc.), which is not arbitrary once prices and 
quantities in any coordinate system are given. On the other 
hand, prices and quantities involve the particular reference system 
being used, and are thus arbitrary. 

It is a familiar mathematical problem to subject two sets of 
variables to transformations, now assumed to be linear, so as to 
conserve their inner products as in equation (43 ). 14 Variables 
related in this way are called contragredient variables , 15 

If we subject the quantities v to the non-singular linear trans- 
formation 

v — cv, v — c~ x v, (44) 

then 

w'v = w'cv. (45) 

If, and only if, 

w' = w'c -1 , or id = c'w, (46) 

will (43) be realized. Thus, the related transformations of prices 
and quantities are 

v — cv, v — c~ l v, (47) 

W = w — c'w, 

where the transposition and inversion of the matrices is to be noted, 
along with the fact that the relationship between the two trans- 
formations is reflexive. 

14 This problem arises in higher geometry, in many branches of mathematics, and 
in multiple regression analysis where predicted dependent variables are to be independent 
of transformations of the predicting variables. As the latter undergo a linear trans- 
formation, the regression coefficients must be adjusted accordingly so as to leave the 
predictions unchanged. 

15 M. Bocher, Introduction to Higher Algebra , p. 108. 
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After transformation profits become 

ft 

ir(v ; w) = x(y ; *) = R(y) — £ (48) 

i 

In form this is the same as the original; consequently, it follows 
from the maximization conditions that everything that could be 
said about ( dv/dw ) can be said about (dvfdw). Thus, 

% - (*•■)- <«) 

is symmetrical and negative definite. 

This might also have been deduced from the fact that 

dv_ _ dfidv_dw ,_ n . 

div — dvdwdtD ' ' 


Substituting into this the equations of (47), we find 

dv dv 

dw~ C dw C 


(51) 


It is a classical mathematical fact that a matrix’s properties of 
negative definiteness and symmetry are preserved after a con- 
gruent transformation of the above type. 16 

There are many equilibrium systems encountered in economics 
which do not arise from an extremum problem and which cannot 
be converted into this form. The various simplified versions of 
the Keynesian system provide but a first example of what is after 
all the general case. A second example is that of the general equi- 
librium equations of Walras. An intermediate case is a third ex- 
ample in which we assume consistent preference behavior, infini- 
tesimal indifference loci at every point subject to certain generalized 
convexities, but without the assumption that “integrability” con- 
ditions are satisfied so that the “local” indifference planar elements 
can be “joined” together to form a family of indifference loci. 
Thus, our preference field may be characterized by a differential 

16 Mathematical Appendix A, p. 14. The first proof above can be used to derive a 
proof for this classical theorem. Also, if we set c — [a], it is easy to show that the 
inverse of a definite matrix, a, is itself definite. This is related to the “inverse’ * profit 
function or “price potential” discussed in chapter iii, p. 46. The latter in turn seems 
related to the so-called Legendre transformation. See A. Wintner, Analytical Founda- 
tions of Celestial Mechanics (Oxford: 1941), chap. i. 
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expression, or Pfaffian, for which no integrating factor can be 
found. Only in the case of two goods can we always find an 
integrating factor which will lead to an “exact differential’’ which 
can be integrated. 17 

The first of the above three examples is of no interest in this 
connection since we cannot pair a set of “price” variables as 
“conjugate” variables to “quantities.” 

Turning to the second example of general equilibrium, I 
should like to point out a grave difficulty with the Hicksian concept 
of stability of general exchange and production. In later chapters 
it is argued that stability is an essentially dynamic concept, and 
that the formal statical treatment of Hicks leads to conditions 
which are neither necessary nor sufficient from a dynamical point 
of view. Here it will be argued that the Hicks stability conditions 
are unsatisfactory from a purely statical point of view. 

This may be illustrated by the somewhat simplified system of 
chapter v of Value and Capital, whose equilibrium is given by a 
set of equations of the form 

x k (pi, ■ ■ ■ , p n ) - 0. (k = 1, • • •, n) (52) 

To be perfectly stable the Jacobian, [_dx/dp~], must have all its 
principal minors oscillate in sign. A matrix with this property I 
shall call Hicksian. To be imperfectly stable, we only impose the 
weaker condition that the (n — 2) and ( n — 1) order minors be of 
the opposite sign. 18 

Since no particular definition of commodities or frame of refer- 
ence is fundamental, if the Hicks conditions are fundamental, they 
must be invariant under transformations of price and quantity. 
A single example suffices to show that a matrix a which is Hicksian 
becomes under transformation c~ 1 [dx/dp']c~ 1 ' , and this need not be 
Hicksian, nor even imperfectly stable. 

17 The problem of integrability goes back to the “classical” discussions of Irving 
Fisher and Pareto. In their “A Reconsideration of the Theory of Value,” Parts I 
and II, Economica , XIV (1934), 52-76, 196-219, Hicks and Alien touch upon it, but 
not in an altogether satisfactory fashion. The most illuminating treatment is that of 
Georgescue-Roegen, “The Pure Theory of Consumer's Behavior,” Quarterly Journal of 
Economics , L (1935-36), 545-593. See also the discussion of chap, v, p. 8 above, and 
P. Samuelson, “A Note on the Pure Theory of Consumer’s Behavior,” Economica , V 
(1938), 61-71, 353-354, for some unsolved problems in this connection. 

18 Value and Capital , pp. 67, 315. 
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Thus for 


we get 


dx 

dp 


dx 

dp 


-1 10 ] 

0 -lj 

u 1 ] 

I* .96 10 ] 

L 0 1.04 J 


( 53 ) 


(54) 


which is neither perfectly stable nor imperfectly stable. 

Of course, if [dx/dp~] had been symmetrical and definite (as 
when income effects can be neglected), this could not have hap- 
pened; but then our system would have been convertible into a 
maximum problem. 

I have shown elsewhere that convexity in non-integrable cases 
implies that the “symmetrical part” 19 of certain matrices be 
definite. This follows in consequence of the relation 


E AxApi =i 0, 


(55) 


where the p’s and the x’s may or may not be under constraints. 
By going to the limit, we find that [jdx/dp + dx/dp'~]/2 and 
[_dp/dx + dpjdx'~\! 2 must be non-positive. 

It is not hard to show that this property, which we shall call 
quasi-definiteness, is invariant under redefinition of commodities, 
or under a general non-singular congruent transformation. Thus, 
let [a, + a'~\j 2 be negative definite. Then the quadratic form 


h' ah = h 




h + 0 


(56) 


is negative definite. Conversely, if h'ah Is negative definite, then 
a is negative quasi-definite. 

With the aid of a non-singular matrix b we subject a to a con- 
gruent transformation so that 


a = b'ab. 


(57) 


19 Every matrix a can be decomposed into a symmetrical and an asymmetrical 
part. Thus, 

a 4 * a — a 1 

a^—— + 


2 


2 
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Then 

h'ati = [ h'b'lplbK} = h'ah, (58) 

and the extreme right-hand member is always negative, as must 
be the left-hand side. Thus, [cL + a'J/2 is negative definite, and 
a is negative quasi- definite. 

The above analysis may be of interest even where there is in- 
tegrability. If income effects cannot be neglected, the Jacobians 
of general exchange equilibrium need not be symmetrical. How- 
ever, if the income effects are symmetrical and definite, or if their 
symmetrical parts are definite, or if their symmetrical parts are not 
so lacking in definiteness as to outweigh the definiteness of the 
substitution effect, then the Jacobians will be quasi-defenite. Not 
only is this property preserved under transformation, but it is 
possible to establish the Theorem: If a is quasi-definite, it is 
necessarily Hicksian, but not conversely. 

This important theorem can be proved in a variety of ways. 
One is to write each principal minor of a in a Taylor’s expansion 
around [a + a'J/2 so that the skew elements (o»,- — oy,-)/ 2 appear 
in a power series. Then it can be shown that all of these elements 
enter in even powers so that the complete a matrix is “more 
Hicksian” than its symmetrical part. This confirms my previous 
remark that asymmetry, per se, makes for greater rather than less 
stability. The real significance of quasi-definiteness can only be- 
come apparent from the dynamical discussions of stability in 
later chapters. 

Demand for a Group of Commodities 

Value and Capital will take its place in history along with the 
classic works of Cournot, Walras, Pareto, and Marshall. Like the 
latter, Hicks has succeeded in keeping formidable mathematical 
analysis below the surface of things and locked up in appendices, 
thereby securing for his work a much wider audience than would 
otherwise be possible. This tour de force was made possible in 
considerable degree by the repeated use of the already mentioned 
theorem relating to the demand for a group of commodities when 
their prices all change in the same proportion. We are now in a 
position to derive this theorem rigorously in a more general form. 
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When a group of prices, say (pi, ■ • • , p r ), all move together, 
it is natural to define a new commodity, x h by the relation 

xi = piXi + P 2 X 2 + • • ■ + prX r . (59) 

Strictly speaking, we are not thereby enabled to replace n com- 
modities by (n — r + 1) commodities. We are merely changing 
our frame of reference, and so we must lose no dimensions or degrees 
of freedom in the process. Our complete transformation of quan- 
tities can be given by 

( 60 ) 

Here we h^ve simply defined all commodities but one to be iden- 
tically the same as before ; 20 the first commodity has been replaced 
by the new composite commodity. The contravariant price vari- 
able must then satisfy the equation 

1 0 • • • 0 

~pr 

pi 

P, IP ( 61 ) 

Pi 
0 


0 1 j 

so as to leave JL px = X) P%- Even though ( n — 1) commodities 
are the same as before, their prices must have changed. This 
will not seem strange if it is remembered that a price is not a 
property of a good itself independently of the frame of reference 
involved. In fact, it is quite possible for either prices or quantities 
to become negative, although X) P x will conserve its original sign. 

Now if we change prices (pi, • • • , p T ) in the same proportion, X, 
holding constant ( pr+u pn), it can be shown by explicit differ- 

20 This is only one of an infinite number of possible transformations which will serve 
thf present purpose. 



pi 

P2 •• 

■ • p r 0 • 

0 

0 


I 
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entiation, dp/d\, that in the new price-quantity coordinates only 
the pi is changed with all other p’s remaining constant. Conse- 
quently, if this price changes, and if income is changed in such a 
way as to keep utility constant, then 


Also 


dx 1 
dp i 


£11 < 0 . 


KuKiz — -Ki 2 2 > 0, etc., 


(62) 

(63) 


so that all of the properties of the demand functions given in the 
previous chapter are satisfied. 

Thus, the demand or a group whose relative prices are unchanged 
satisfies the same inequalities as in the case of a single good. 

This is the fundamental Hicksian theorem. 21 It can easily be 
broadened so that we may work with s( S n ) composite goods. 
Their substitution terms form a negative definite quadratic form. 

This is a very useful theorem. It states that any group of 
commodities whose relative prices remain unchanged can be lumped 
together into a single commodity, and that indifference curves in 
fewer dimensions can be constructed which will have all the usual 
properties of indifference curves. Thus, all goods but one can be 
lumped into one commodity w r hich Hicks calls “money,” and then 
concave indifference curves can be drawn between the good in 
question and “money.” 22 

It would seem that this involves rather a strained use of the 
term “money,” and one almost certain to lead to confusion. 23 
Very properly when Hicks later comes to discuss monetary matters, 
he rejects this earlier notion. 

21 Proved in his Mathematical Appendix, pp. 311-312, as a consequence of his sixth 
rule relating to substitution terms. All six of these rules are contained in the statement 
h r \_Xif]h is non-positive definite quadratic form of rank (n — 1), which vanishes for 
values of h proportional to prices, this being an immediate consequence of the primary 
conditions of equilibrium and the secondary conditions guaranteeing an extremum 
under constraint. 

22 Value and Capital , p. 33, passim , diagram on p. 39. Also see A. G. Hart, “Peculi- 
arities of Indifference Maps Involving Money,” Review of Economic Studies , VIII (1941), 
126-128. 

23 Such an instance is provided by a writer who interprets Hicks literally and con- 
fusedly attempts to throw' light on the “inflationary gap” by means of an indifference 
diagram in which money is taken as a catchall for all but one commodity. M. W. Reder, 
“Welfare Economics and Rationing,” Quarterly Journal of Economics, voL LVII (194£). 
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The General Problem of Composite or 
Aggregate Commodities 

Any economic system when viewed carefully will be found to 
consist of an almost uncountably large number of variables. It is 
almost a necessity if advances in analysis are to be made to simplify 
matters artificially so as to reduce the number of variables which 
are to be handled. This is done in a number of different ways. 
Some authors retreat to a one or two commodity world in order to 
derive more precise results ; the penalty for this lies in the difficulty 
in establishing the relation between the simplified construct and 
complex reality. This, however, is at least an honest procedure. 
Other writers wish to have their cake and eat it too : to work with 
only a fev^ variables and at the same time retain an air of realism 
and versimilitude. 

Here too a variety of artful dodges are open to the investigator. 
(1) He may hold other things equal. (2) He may concentrate on a 
representative firm or family, each of whom is doing about what 
all are doing. (3) Or he may work with certain aggregate, com- 
posite magnitudes, such as bales of output, “socially necessary” 
labor, “wage units,” cost of living, real national output, etc. 

It is the third device which is of interest here. There is 
nothing intrinsically reprehensible in working with such aggregate 
concepts. On the contrary, abstraction from complexity is a 
necessary thought process. And in any case the most general 
equilibrium set up must necessarily stop short of the full total of 
all possible economic variables. But it is important to realize the 
limitations of these aggregates and to analyze the nature of their 
construction. 

Broadly speaking, we replace a number of variables by a single 
variable under two diametrically opposite conditions. In the first, 
the variables have each the same effect (except possibly for scale 
differences which disappear upon redefinition) upon all of the eco- 
nomic functions under discussion. In this case, they can be 
summed and treated as one. Thus, we might define as the same 
commodities all goods which have exactly the same influence on 
consumption preference fields and production functions, which are 
so to speak infinitely substitutable. In practice rigid adherence to 
perfect substitutability might lead to the unmanageable result that 
no two things are quite the same. Consequently, goods which do 
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not significantly (for the purpose at hand) differ are treated as 
identical. 

Actually the above case of substitutability is but one example 
of the more general mathematical theorem whereby the identical 
vanishing of a Jacobian, or of all of its minors of a given order, 
implies that there exists functional relationships between sets of 
the variables, so that many may be dropped out of the picture 
completely. It is clear from the implicit function theory that such 
simplifications are possible only in the case where the original 
system was indeterminate . 24 This may or may not be a matter of 
concern. What if the variables which are extraneous cannot be 
given determinate values by the system of equations which define 
equilibrium? The indeterminate variables may be a matter of 
indifference to the economist. Thus, take any determinate system 
of economic goods. Let each unit of one type of good be marked 
with an invisible serial number, and now let us ask how many odd 
numbered units will be bought by a given consumer. Clearly the 
answer is indeterminate, but also of no possible interest. If con- 
sumers did have preferences as between different serial numbers, 
then the indifference schedules would be affected by this fact, and 
additional equations would be available to determine the final allo- 
cation. And notice that even in the case of complete indifference 
the final allocation is not really indeterminate on any particular 
occasion. It is just that the determining factors, which are taken 
as due to “chance” by the economist, would be of a different 
character. 

Unlike the first condition under which variables are lumped 
together because they are infinitely substitutable, at the other ex- 
treme we combine variables which bear an invariant, “perfectly 
joint” relationship to each other. A notable example is provided 
by the classical “dose” of labor and capital applied to land. Here, 
too, the economist customarily relaxes the rigid requirement of 
perfect collinearity in favor of an approximate condition. Thus, 
the most primitive justification for a price level construct is pro- 
vided by the indisputable fact that prices do generally rise and 
fall in about the same proportions. 

24 This must be qualified. Our original equilibrium equations may be imbedded 
in a still wider set of equations so that the total is determinate, but the original subset 
taken by itself is not. 
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Geometrically this differs from the first case in which indiffer- 
ence curves or iso-quants are straight lines. Here the contours are 
broken lines meeting at right angles in such a way that the goods 
or factors will by obvious choice be combined together in the given 
proportions almost regardless of price ratios. Unlike the Hicksian 
bundle of goods the physical composition is the same regardless of 
relative prices. 

Of all composite magnitudes perhaps the most interesting to 
the theorist is that of an index of the cost of living, or Of an index 
of consumption output. Such an index is designed to meet certain 
special requirements and need not be satisfactory for other pur- 
poses. In particular, it need not represent the desideratum from 
the standpoint of the questions which Jevons and other pioneers 
in the use of index numbers sought to answer. Yet the theory of 
these indices is of some interest for its own sake and because in the 
course of investigation of these index numbers economists inad- 
vertently stumbled upon certain ordinal relationships which are 
basic to welfare economics and to consistent consumer’s behavior. 

The Economic Theory of Index Numbers 

Economists such as Jevons, Edgeworth, Marshall, Allyn Young, 
Warren Persons, Irving Fisher, Edwin Frickey, and others have 
made contributions to what may be called the statistical theory of 
index numbers. But what has come to be called the economic 
theory of index numbers is concerned with quite distinct matters. 
To this theory many economists have made contributions. A 
partial list would include the 'names of Wicksell, Konus, Bortkie- 
wicz, Bowley, Haberler, Pigou, Keynes, Staehle, Leontief, Allen, 
Lemer, Frisch, and Wald . 25 

With the exception of Leontief all of these writers are unduly 
preoccupied with the problem of the price index number, instead of 
being occupied with the more fundamental question to which it is 
only a partial, and somewhat arbitrary, answer. The fundamental 
problem upon which all the analysis rests is that of determining merely 
from price and quantity data which of two situations is higher up on 

!5 Reference may be made to the survey article by R. Frisch, “Annual Survey of 
General Economic Theory; The Problems of Index Numbers,” Econometrica, IV (1936), 
1-38; also to the article by W. Leontief in the same issue. The early volumes of the 
Review of Economic Studies may be consulted for other discussions. 
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an individual’s preference scale. This problem admits of a partial 
answer if certain rigid assumptions are fulfilled. 

It is assumed that there is only one individual, whose tastes do 
not change in the period under consideration ; or if there are more 
than one, that their tastes are identical. It is not necessary to 
assume that the regime of commodities is the same in the two 
situations provided we follow the convention of setting at infinity 
the price of any good which is unavailable. At whatever the 
prices named, the individual is presumed to buy as many or as 
few of the goods as he wishes. This rules out such phenomena as 
rationing or monopsony. Ordinarily, a comparison is made be- 
tween two situations which differ in time, but we might just as well 
compare two situations different in any respect, as in tile case of 
comparisons in the cost of living between two regions. 

Our fundamental data consist of prices and commodities, in- 
cluding productive services which can be treated as negative com- 
modities, in the two situations, represented respectively by (P®, X a ) 
and ( P b , X b ), where these are shorthand notations for the prices 
and quantities of n goods. Of course, if we know completely the 
individual’s whole preference field, we may simply insert into it 
the two amounts of commodities and read off which is the better, 
or whether they are indifferent. Since we do not know the field, 
our problem is to go as far as we can on what we do know. 

If for the moment we restrict ourselves to the quantity data, 
we ordinarily cannot say which of the two situations is the better. 
However, in the exceptional circumstance in which one of the 
situations contains more of some of the goods than the other, and 
not less of any of the other goods, then it is clear which is the better. 
Taking situation A as our reference point, and thinking of the 
simpler two-dimensional case, we can definitely divide up the whole 
of the commodity space into four regions with respect to X a by 
letting two perpendicular horizontal and vertical lines (not shown 
in the figure) intersect at this point. Treating this as the origin, 
we can definitely classify all points in the northeast quadrant as 
better than X a , and all points in the southwest quadrant as defi- 
nitely worse. We are left then with ignorance concerning the in- 
terior points of the other two quadrants. In these more of one 
commodity has been acquired at the expense of less of another, and 
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until we have further information concerning the preference field 
this is as far as we can go. 

But it should be possible to utilize our price information. If 
the individual is supposed to be maximizing his ordinal preference 
field subject to the given budget relation, we may be sure that the 
point X a is better (or not worse) than any point between the budget 

REGION OF IGNORANCE IN ANALYSIS 
OF INDEX NUMBERS 



Figure 1 


locus and the axes. For all such points were available to the in- 
dividual in the initial situation, and he nevertheless preferred to 
select X°. It follows that for all such points 

U(X) m U{X“). (67) 

Thus, we have narrowed down the region of our ignorance, but not 
completely. 

It will be noted that the two situations have not been treated 
symmetrically, A always being the reference point. It might well 
happen that X a lay inside the budget line of the point X b . Assum- 
ing for the moment that knowledge of the complete preference field 
were available, for what values of X b would this be possible? Ap- 
plying the requirement that X a must cost less than X b at the prices, 
P b , the boundary of this region will be given by the equality 


E P b X a = E P h X b . 


( 68 ) 
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Geometrically, this locus is the result of swinging the budget line 
through the point X a , and determining for each position the point 
where it reaches the highest indifference curve, to which it is 
clearly tangent. If we connect all such points, we have a familiar 
offer curve. All points above this offer curve, are definitely better 
than X a , in the same sense that all points under the budget line 
of X a are definitely inferior to X a . (It is to be emphasized that a 
knowledge of two points does not give us this locus in the way that 
it gives all the previous boundaries. Nevertheless, given two such 
points, it would always be possible to decide just where X b was 
with respect to X°.) 

We have now narrowed our ignorance down still further. Ac- 
tually this is as far as we can go on the basis of the giyen data. 
Note that the old boundaries of our ignorance derived from the 
utilization of quantity data alone have been made obsolete by the 
additional light furnished by the auxiliary price information. 

We have gone a long way, but there is still left a definite area 
of darkness — the space between the two heavy lines. I should 
like to state as strongly as possible that this final indeterminacy is 
intrinsic and inherent. No amount of ingenuity can remove it, 
grounded as it is in the fundamental convexity properties of the 
indifference field, or more accurately in the consistency behavior of 
the individual. It is important to prove this rigorously, for pe- 
culiarly in the literature of index numbers is an attempt made to 
search for limits within which the truth must lie without at the 
same time investigating whether or not these are the best possible 
limits. Moreover, the limits themselves are sometimes derived 
under special approximations, such as the neglect of “squares of 
small quantities,” etc. 

To see that these are indeed the best possible limits under the 
circumstances, let us suppose that someone proposes narrower 
limits. Since our preference field is arbitrary except possibly for 
certain curvature properties, we can draw the true indifference 
curve through X a so as to contradict any more definite result. If 
the proposer claims that a given point in the area of darkness is 
worse than X a , we can pass the indifference curve below that point 
but above the correct lower boundary in such a way as to make this 
statement wrong. Similarly, the opposite statement can be shown 
not to hold universally. (It is of course understood that “we” do 
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not actually change the indifference field of the economic unit under 
observation; but “we” can find a consistent field for which the 
given result holds.) Where the offer curves bend back, we are able 
from quantity considerations alone to extend our boundaries along 
the dotted lines indicated in the figure. 

For well-behaved indifference fields with the proper continuous 
derivatives, the offer curve which gives the upper boundary of the 
area of darkness will be tangential to the indifference curve going 
through X a and to the lower budget boundary through X a . Even 
if there is a corner in the indifference curve at X a , there must be 
generalized tangency in the sense of touching from above without 
crossing. One might be tempted to say that the area of inde- 
terminacy narrows down to a point in the vicinity of X a , but 
nothing is gained thereby. Nevertheless, the mathematician is 
tempted to consider Taylor’s expansions in the neighborhood of 
the X a point and to neglect terms of higher order. Perhaps there 
is even some empirical statistical usefulness in the handling of 
budgetary data to be found in this practice, which has been associ- 
ated with the names of Bowley and Wald. 

But from a fundamental viewpoint there is no getting around 
the fact that for any finite move, however small, there remains a 
region of ignorance; this region may narrow down as the size of 
the movement decreases, but it never vanishes, except for the 
trivial case of a vanishing movement . 26 Thus, there is nothing to 
be gained for the present purpose from following the practice of 
Divisia 27 in working with differentials or infinitesimals. The fact 
that these avoid difficulties connected with the time reversal and 
factoral reversal is not an indication of their superiority, but of the 
fact that they sidestep the intrinsic difficulties of the subject 
matter. 

By the examination of value sums we can sometimes, but not 
always, state definitely whether one situation is worse than another. 
But we can never by these means state that two situations are 
equally desirable. In fact, as will become evident from later dis- 

16 It may be worth mentioning that even a knowledge of the curvature of the in- 
difference curve at X a will not enable us to narrow down our region of ignorance. It 
will enable us to assert with confidence that the indifference curve will approach arbi- 
trarily near to the osculating circle, but the departures for any finite small movement 
may be of either sign and of any magnitude. 

27 F. Divisia, Economique Rationnelle (Paris: 1928). 
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cussion, knowledge of prices and quantities on a finite number of 
points will not permit us to determine the equality of two points; 
but in the limit as the number of points becomes infinite we can in 
fortunate circumstances determine points of indifference. 

This result may be approached by a consideration of the addi- 
tional information which a third point will yield. We now have 
three pairs of points, and it may turn out that any two of them 
when put to the test of our computed value sums gives a definite 
answer. In that case each pair can be considered by itself without 
regard to the third point. Of course, from the discussion of chapter 
v it will be clear that a consistent ordinal preference field can never 
give contradictory testimony whereby point A is better than B, 
which is better than C, and at the same time the lattes is better 
than A . For the relations of ordinal utility are transitive. 

But the relations of better or worse as revealed by the value 
sums are not transitive ; that is why I have elsewhere proposed a new 
notation to represent “revealed” preference in this special sense. 
Thus, if our value sums give a definite result so that 

£ P'X h S £ P»X«, (69) 

this fact may be represented by the symbol 

X 6 © X*. (70) 

Since 

X 6 ©X‘ implies U(X b ) < U(X«), (71) 

and this in turn implies 

Z7(X“) < U(X b ), (72) 

then to avoid a contradiction in logic, we must be able to state 
the theorem 

X b ©X“ implies X*@X\ 

But this is not the same thing as the meaningless assertion that 
X b ©X« implies X‘@X 5 , (73) 

or the incorrect assertion that 

X 6 © X* and X« © X 6 implies X' © X“. (74) 

The most that can be stated under the above hypothesis is that 

X« © X e : (75) 
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This is much weaker than transitivity. An additional indication 
that the algebra of revealed preference is quite distinct from that 
of cardinal or ordinal numbers is the fact that equality is not 
defined; consequently, two points cannot be placed in one of the 
three categories: A worse than B, or B worse than A, or the two 
equally good. All we can say is: either A is revealed to be worse 
than B, or B is revealed to be worse than A , or there is no indica- 
tion one way or another. These are mutually exclusive categories 
only if a consistent preference field is postulated. 

It is precisely because of this lack of transitivity that knowledge 
of a third point may add knowledge to a comparison between two 
given points. Our value sums may give no indication with respect 
to points X a and X c taken by themselves, but an intermediate point 
X b may serve to indicate their true ordinal relationship to each 
other. We are in a position now to indicate just how much points 
X b and X c may narrow down our ignorance in relationship to X a . 

If the two additional points both lie in our old area of darkness, 
we are no better off than before. But if X b lies in the upper region 
of certainty with respect to X a , and X c lies in the upper region of 
certainty with respect to X b , then even though X° lies in the X“’S 
region of uncertainty, it can still be definitely said to be better 
than X a . Therefore, we have narrowed down our area of darkness. 
How much can this area be reduced in the most favorable case? 

From the geometry of the problem it can be shown that X b must 
lie on X a ’s offer curve for the best results. We then proceed to 
draw the offer curve through X b . This will cross the old one and 
thereby narrow our region of ignorance, since every point above 
the new offer curve is definitely better than X a . If now we let X b 
take each and every position on the old offer curve, the process can 
be duplicated as many times as we wish, giving us a one parameter 
family of new offer curves. The lower envelope to this family of 
curves gives us our new upper boundary of uncertainty. 

In the same way, we can secure a best lower boundary by letting 
X b travel along the original budget line, generating through each 
such point a new budget line, or a one parameter family of such 
lines in all. The upper envelope of this family of lines is our new 
lower boundary. Our new boundaries must of necessity lie within 
the old ones, but there is still left an area of darkness. 

If a fourth point is added, we may narrow our ignorance still 
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further; similarly with a fifth point. It is intuitively clear that 
the boundaries will never meet for any finite number of points ; but 
in the limit as the number of points becomes infinite, the upper and 
lower boundaries approach a common limit, which is of course the 
indifference curve through X a . Points along this curve, and only 
such points, can never find their place in a finite chain of points 
which serve to relate them unambiguously to the initial situation. 
Thus, with a single dimensional infinity of points any indifference 
curve may be traced out. With a two-dimensional infinity of 
points, the whole indifference field can be determined. In fact, 
with this much knowledge we may dispense completely with re- 
vealed preference, and instead “integrate” completely our elements 
of slope at each point into a one parameter family of ir’difference 
curves. 

The above discussion is concerned with the narrowing of our 
field of ignorance under the best circumstances. In actual practice 
we shall not necessarily do nearly so well. An instance is provided 
by the frequently met case in which we know a complete budget 
line (expenditure path, income consumption curve, etc.) giving the 
behavior of changes in all of the commodities with changes in 
money income but with unchanged prices. Such a situation is 
provided by observations of the behavior of more or less similar 
people, all confronted with the same prices but differing with 
respect to total expenditure. 

Not only is this of considerable statistical importance, but it is 
of special relevance in connection with the usual economic theory 
of index numbers, where a comparison is made between two price 
situations, rather than between two particular price-commodity 
situations. By applying our previous analysis it will be seen that 
the knowledge of an infinite number of points along two budget 
lines will not be sufficient to determine the whole preference field, 
or even a single indifference curve, or even be sufficient to enable 
us to match points of equivalent satisfaction on the two budget 
lines. If we select a given point on one line and through it draw 
a budget line and an offer curve, then these last two loci will split 
the other budget line into three parts. The lower part will consist 
of points all worse than the initial point, the upper part will consist 
of points all better than that point, while the intermediate part will 
constitute our area of indeterminacy. 
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Of course, the closer together are the two price situations, the 
smaller will be the indeterminacy. Therefore, we can narrow down 
our ignorance by a knowledge of many intermediate budget lines. 
In the limit, as the number of budget lines becomes infinite in such 
a way as to decrease indefinitely the distance between each, we 
shall approach the complete indifference field. 

The value sums considered up until now have all had the dis- 
advantage of being capable of yielding indeterminate results. We 
shall put up with this if need be, but first we must determine 
whether it is not possible to devise an index of quantity, computable 
only from the prices and amounts of all the commodities, which 
will be an unfailing indicator of ordinal utility. Can we not find a 
magic formula which will have these properties when applied to 
any preference field whatsoever, or at least to those of the proper 
convexity? 

The answer is no. As yet no one has devised such a formula, 
and the following mathematical reasoning shows why no one ever 
can, even if we permit more general functions than simple value 
sums. Any such general formula will from the nature of the 
problem be a function of quantities and prices in one or more 
(usually two) situations. Without loss of generality we may con- 
sider the prices and quantities of all but one of the situations as 
fixed, while the one situation represents an arbitrary variable point. 
In terms of its prices and quantities, (P, X ), we must be able to 
construct a single-valued function Q, which is constant so long 
as the X points remain on the same indifference curve of the 
(unknown) preference field. Because this is to be a quantity index, 
it is clear that relative prices only can be important, and so we may 
express all prices in terms of the first good as numeraire. Then Q 
may be written as 

( 76 ) 

For observable equilibrium quantities the ratios of prices are equal 
to marginal rates of substitution ( l R 2 , • ■ • , 1 R n ) , or to ratios of 
marginal utilities ( U 2 / Ui, • • • , U n / U i) . Although the form of the 
relationship will change from preference field to preference field, 
and is in any case unknown to us, there will still nevertheless in 
each specific instance be a functional relationship between these 
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ratios and quantities of goods. Therefore Q will be a uniquely 
determined function of the x’s alone. In fact, since Q is constant 
along an indifference curve, it must itself be one cardinal index of 
utility; its partial derivatives with respect to each commodity 
(computed by adding to the direct effect of each x on Q all indirect 
effects via influence on price changes) can be interpreted as marginal 
utilities, whose ratios are equal to observable price ratios, etc. 

In consequence of this last fact, equation (76) can be written 
in the form 


e ( xi. 


Xn 


§Q /dQ 

dxi / dxz 


dQ_ /dQ_\ 
dxi / dx n ) 


Q S 0. (77) 


This identity is a first order partial differential equation *which the 
preference field must satisfy, and only a limited small subset of all 
preference fields will do so. Consequently we have proved the 
impossibility of finding a magic formula to serve as a quantity indi- 
cator in the general case of a consistent ordinal preference field. 

In actual fact it is customary to restrict the form of a quantity 
index still further, to require that it be a homogeneous function of 
the first order in the component quantities, etc. This restricts the 
range of applicability still further, to preference fields possessing 
expenditure proportionality, etc. 

For any particular 9 it may be a problem of some mathematical 
difficulty to determine the exact restriction on the preference field. 
But where 9 represents the Laspeyre quantity index, thought of as 
a function of the quantity components of the second point, with 
base point fixed, it will be immediately seen that the only possible 
preference field for which it is exact is that of a family of parallel 
straight line indifference curves. Of course, this is almost a re- 
ducto ad absurdum, since convexity is denied, and since all relative 
prices (of goods which are bought) cannot vary. 

The “ideal index number” is one of the most popular index 
numbers. It is the geometric mean between Laspeyre’s and 
Paasche’s index numbers. A. Konus and S. Buscheguennce, 28 as 
well as S. Alexander (in an unpublished Harvard paper), have 
shown that it is exact only for certain hyperbolic indifference 
curves. This theorem may be derived by solving explicitly the 

28 See the reference in H. Schultz, “A Misunderstanding in Index-Number Theory," 
Econometrica , VII (1939), 8. 
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implied differential equation. Similar results can be derived for 
other “approximate” formulae which have been suggested. 

As a result of the analysis of this section, the conclusion may 
be ventured that the important economic content of index number 
theory resides in the fact that it attempts to utilize limited price 
and quantity data to infer ordinal preference comparisons. The 
above formulation seems best designed to reveal this essential con- 
tent, and to show the intrinsic limitations necessarily involved. 
In the next section the same analysis is shown to be useful in 
connection with the more commonly met formulation of the index 
number problem. 

Present Formulations of Index Numbers 

r 

We are now in a position to apply our tools of analysis to the 
more familiar branch of index numbers. First, we consider price 
of living comparisons between two different price situations. The 
price of living index number in going from the situation ( X a ) to 
(. X b ) is familiarly defined as the ratio of the cost of the cheapest 
bundle of goods at the prices of the second situation which will 
yield satisfaction equivalent to that of the initial situation, to the 
cost of the initial bundle at the initial prices. 

We assume as known (say from empirical studies on “identical” 
individuals with varying incomes) the price expansion paths in two 
price situations, (P a ) and (P b ) . These are defined by the following 
sets of parametric equations: 


Xi = hi(pi a , 
Xi = hi(pi b , 


' J Pn a > -0, 

■,pn\r>. 


(i 

a 


1, 

1, 


•, n) 
» ) 


(78) 

(79) 


Consider an initial situation (X a ) on the initial expansion path. 
The indifference locus corresponding to this point will intersect 
the second expansion path in a point called ( X ab ) such that 

U(X ab ) = U(X a ). (80) 

An index number of the price of living is defined as follows: 

£ P b X ab 


Similarly, 


fab 


pa 


ZP a x a 

£ P a X ba 
£ P b X b 


(81) 


(82) 
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where 

U(X ba ) = U(X h ), (83) 

and ( X ba ) lies on the “initial” expansion path. 

These respective index numbers are not rigidly related; in 
general they are not reciprocals. Since we do not know the hypo- 
thetical preference field, (X ab ) is not known, and so our index 
number cannot be computed. 

However, from the considerations indicated above we know 


that all of the points on the expansion path ( b ) fall into 
classes with respect to (X a ) : 

three 

1. 

(X) © (X“) 

z P a X S z P a X a 

(84) 

2. 

m © (x) 

Z P’°X a si Z P b X 

(85) 

3. 

(X) © (X«) 
(X-) © (X) 

z P a X > z P a X a 
z P l X a > Z P b x. 

(86) 


In particular, consider the equality signs in (84) and (85) and we 
get the boundary points defined by the intersection of 


and 


Z P h X = Z P b X a 

Xi = hi(pi b , ••• , fin , I). (« = !,•••, n) 


(87) 

( 88 ) 


Call this point ( u X ab ). 

Consider the intersection of 


and 


IP“I=S P a x* 

Xi = hi(pi b , - • - , pn b , /). (i = 1 , ■ ■■, n) 

Call it ( l X ab ). By definition 


Hence, 

or 


(‘X*) © (X“) © («X<*). 
Z7( z X a6 ) < U(X a ) < U(*X<*), 
U( l X ab ) < U(X ab ) < U( u X ab ). 


(89) 

(90) 

(91) 

(92) 

(93) 


Since along an expansion path the cost of a batch of goods and its 
“utility” are monotonically related, we have 


Z P b l X° h < Z P h X ai < Z P b u X ab . 


(94) 



• Mi; y , 

''fBf 
, ii |ji 
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Hence, dividing by the appropriate factor (if it does not vanish), 
we have 


or 


£ P b l X ab ^ £ P b X ab ^ £ P b “X° 5 
£ P a X a < £ P a X a £ P a X a : 

£ P b l X ah £ P 6 “X« 6 


(95) 


(96) 


£ P°X° ' * " £ P a X a 

These are valid double limits for our index number computable 
under our specified assumptions. More directly from the defini- 
tion of ( u X ab ) , we have 

£ P b u X ai = £ P b X\ (97) 

Hence, oifr double limits are 


£ P“X a 

By symmetry we have 

£ P a l X ba 
£ P b X b 


£ P b X s6 < Inh < £P & X° 


< P a < 


£ P a X a 


ZP a X b 
£ P b X b ’ 


(98) 


(99) 


where ( l X ba ) is symmetrically defined with respect to ( l X ab ). 
Again, these four limits are not rigidly related, no pair being neces- 
sarily reciprocals. Furthermore, it follows from the analysis of 
the earlier sections that these are the best possible double limits 
under our hypothesis, and will never converge to equality for 
convex indifference curves. 

It is clear that the two upper limits are detenninable from price 
and quantity figures alone. The points ( l X ab ) and ( 1 X ba ), however, 
can be computed only if certain subsections of the respective ex- 
pansion paths are empirically known. Dropping this assumption, 
we may ask the question as to what lower limits can be computed 
from price and quantity figures alone. This being a more difficult 
problem, the answer will in general be less satisfactory. 

Recall that ( l X ob ) was the intersection of the budget plane of 
(X°) with the expansion path of the second price situation. Since 
we do not know the expansion path, we cannot compute ( l X ab ). 
But if we restrict ourselves to positive quantities, it is always 
possible to find a point on the budget plane whose cost is less than 
('X® 6 ), and which must a fortiori be a lower limit. Such a point is 
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that point on the budget plane whose cost at the prices ( P b ) 
furnishes a minimum with respect to all points on the budget plane. 
Let us designate this point ( x X ab ). It may be derived by 


minimizing 


F = Z 

1 

(100) 

subject to 


Z PfXi = Z PfXf. 

1 1 

(101) 

This is a constrained minimum problem, and, since the equations 
are linear, the minimum is a restricted or boundary one, derived 
from the fact that no negative quantities are admissible. Ac- 
tually, it is easily verified that the minimum quantity ( b X ab ) is 
(0, 0, • ■ • , x x m ab , • • • , 0, 0) where x m is the good whose price ratio 
between the two situations is highest, i.e., 

Pm Pi b 

Pm* = Pf' 

(102) 

( X X° 6 ) is easily computed since 

, . z P a X a 

Xm ~ p m ° 

(103) 

Hence this lower limit developed by Mr. Lemer is as 

follows: 

Z P b b X ab pj Z P a X a p m b 

Z P a X a p m a Z P a X a p m a 

(104) 

By symmetry, pf/pp is a lower limit for I ba , where 


All 

(105) 

In particular cases other lower limits are possible, 
that our actually observed situation ( X b ) is such that 

It is possible 

m © m. 

(106) 

1 ' e ’’ z P a X b £ z P a X a . 

(107) 


Recalling the definition of ( l X ab ), obviously 

E P a x b s Z P a l x ai 




i6o 

and 
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£ P b X h < £ P b l X°\ 

and so in this case we have a iower limit as follows: 


ZP b x b 


< i ai . 


ZP a X a 

Note that this holds only if (106) is realized. 

(X‘) © (. X* ), 

ZP a X a 


given 
we have 


ZP b x b 


< i ba . 


(109) 

On the other hand 

( 110 ) 

(HI) 


Except in«the coincidental case where the equality sign holds, these 
limits will be worse than those deducible from the whole expansion 
path. They may be better or worse than the Lerner limits given 
in (104) depending upon the case selected. Note that it is im- 
possible for 

(X“) © (X‘) 

simultaneously with 

m © (X“). 

Hence it is not possible to derive lower limits to both indexes 
simultaneously. In fact, given 

(X 6 ) © (X“) 

and 

(X«) © (X*), 

it is impossible to compute either lower limit in this manner. 

If we widen our assumptions as to initial information, still other 
limits are possible. Knowledge of a third point may be utilized 
by the methods of the previous section ; as may also knowledge of 
any intermediate expansion paths. In fact, in the limit as all inter- 
mediate expansion paths are known, i.e., as we know the functions 

Xi h ip l, j Pm P ) » (^ 1) * * * * P) 

the indifference map itself may be solved for implicitly. 29 

It is possible to develop similar relationships between usual 
quantity indexes. Let us define our quantity index from the ath 

24 I do not discuss the effect of imposing the assumption of monotonic expansion 
paths, since there is no reason to rule out “inferior” goods. 
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situation to the 6th as follows: 


nat, _ E P b X b E P h X b £ P a X a P 

y E P 6 * 06 E P 0 X‘ E P a X ab ~ I a P b ’ 


( 112 ) 


i.e., this index is the ratio between the actual cost of the batch 
( X b ) to the cost of the cheapest bundle at (P b ) prices which would 
yield an equivalent satisfaction to that of ( X a ). 

Similarly, 


Q ba 


E P a X a I a 
E P a X ba ~ PI ba ’ 


(113) 


Obviously, if one situation is preferred to the other, the index 
number between them will be less than one, i.e., 



U(X a ) < U(X b ) implies Q ba < 1, 

(114) 

and conversely. 

Likewise 




U(X a ) < U(X b ) implies Q ab > 1. 

(115) 

Hence, 

Qla > 

1 implies < 1, 

(116) 

and 

Q ab > 

1 implies Q ba < 1. 

(117) 

More generally, 

Qab = 

1 implies Q ba = 1, 

(118) 

where the inequalities are to be taken in the indicated order. 

However, 


Q ai X Q ba t 

(119) 


except under special circumstances. The above relations are 
consequences of the concavity of the indifference loci. 

Of course in the absence of knowledge of the preference field, 
it is impossible to compute these index numbers, since we do not 
know (X ai ) and (X ba ) . From the last section we know that 

E P b b X<* =s E P b l X ab < E P b X ai < E P b u X ai . (120) 

Hence, 

E P b X h ^ -£P b X b ^ nab ^ E P b X b 
E P b b X ab = E P b l X°* y ^ E P b U X C 


( 121 ) 
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or 


E P h X b ^ YLP b X b 
£ p 6 x x a6 = £ P b l x°* v £ P b x a ' 


( 122 ) 


As before, (*Z ai ) can be computed only if we know the expansion 
paths. From (103) we find 


E P b X b pn? E P b X b 
p m b T.P a X a ' 

Therefore, 


(123) 


t~. KPN =- ZItN > 0 ^ > - “Lasoevre" f 1241 

#.* £ W - £ p* -X* y J > E " p y • !U4 ' 

Similarly,. 

^ EW ^ £P°X° _ 1 

£ P 6 Z 6 = £ P° Z 6 “ ^ * E P°2f 6 “ “Paache” 


As before, if we have 

m © m, 

or 

m © m, 

still other limits are possible. 

Now it is only in the case where the “Laspeyre” index is greater 
than one or the “Paache” index is less than one that it is possible to 
say which of the situations is better. But, and this is what makes 
this whole branch of theory a sterile exercise, it is always possible to 
determine this fact without any limits or without even introducing 
such index numbers. For our question is always answered, when 
it can be answered, by the previous analysis Of “revealed” prefer- 
ence, and the introduction of quantity and price indexes is indirect, 
unnecessary, and misleading. It is indirect and unnecessary be- 
cause it is a deduction from the previous simpler analysis. It is 
misleading because of the tendency to attach significance to the 
numerical value of the index computed. 30 There is not a single 
valid general theorem in the present field of index numbers which 
is not deducible from the analysis of the previous section. 

Of course, in the case of expenditure proportionality, certain new 
invariances do emerge. (1) It is always possible to derive double 

80 See the penetrating remarks of W. Leontief, “Composite Commodities and the 
Problem of Index Numbers,” Econometrica , vol. IV (1936). 
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limits; (2) The respective index numbers are reciprocals, i.e., 

QbaQab = 1 = JabJba^ ( 125 ) 

One last serious drawback to the present formulations of index 
number theory rests in the fact that these are expressed as ratios. 
If we admit productive factors as negative commodities, and for 
many purposes this greatly enlarges the generality of our analysis, 
denominators may vanish or change sign. Since all we wish is an 
algebraic comparison between value sums, it is unnecessary and 
undesirable to work with ratios. Instead the methods of the 
previous section are preferable. 

Pure Theory of Choice under Rationing 

c* 

Thus far in this chapter we have discussed the effects of general 
transformations of our dependent and independent variables. 
This led naturally into a study of composite commodities and of 
index numbers. At this point there is also suggested the interesting 
and important related problem of placing constraints upon the 
consumer in addition to that imposed by fixed total income. Of 
course, rationing involves such constraints. 

The simplest kind of rationing is that in which the government 
specifies the maximum amount of a particular commodity that each 
individual can consume. Sugar provides a common example, being 
relatively homogeneous and universally in demand. Ordinarily in 
rationing the individual is subjected to an inequality. He must 
not consume more than a given amount, but he need not consume 
as much as that amount. Of course, if the rationing is to be of 
any use, it will be applied to situations in which for many indi- 
viduals the allotted amount will actually be bought. Otherwise 
the law has no teeth, and individual behavior is left unchanged. 

The single individual maximizes ordinal utility as before except 
that he is now subjected to additional constraints of the form 

Xi^bi, Xj bj, •••, (126) 

where the commodities i, j, ■ ■ ■ are given maximum individual 
quotas of respective amounts per unit time, b iy bj, • • • . To present 
the conditions of equilibrium in these circumstances we must dis- 
tinguish carefully between various possible cases. The simplest is 
that in which the rationing quotas are so small that each is effective. 
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In this case, the equality signs will hold in the above equation. 
Here we lose as many degrees of freedom as there are commodities 
effectively rationed. In terms of the remaining commodities our 
conditions of equilibrium are just as before since each partial 
differentiation assumed other commodities constant. However, 
overall consistency is maintained by the fact that we write down 
partial derivative conditions only for those commodities which are 
not frozen at given levels. Thus, for each equation added in (126) 
we drop one equation of the form 

<Pi 4* \pi — 0, (127) 

replacing the equality sign by the inequality “greater than.” For 
only with respect to goods whose amounts can be voluntarily 
augmented will the marginal utility of their last dollar of expendi- 
ture be equated to the marginal utility of expenditure in all other 
lines. Goods which are arbitrarily limited at some quota have a 
marginal utility of expenditure which may and ordinarily will 
exceed that of the non-rationed goods. 

So far, we have been discussing the case where all quotas are 
effective. If any particular one is not effective, then the inequality 
rather than the equality sign is relevant, and we may disregard 
completely the fact that it is rationed and treat it exactly as we 
would an unrationed commodity. 

So much for the theory of single commodity rationing, a theory 
which is elementary and intuitively obvious. Before leaving it, 
however, one final point should be noted. The above analysis 
would suggest that the authorities could independently ration 
(n — 1) commodities but not all n. For if all commodities but one 
are rationed, the amount of the last would seem to be effectively 
frozen by the budget equation, and therefore would not be subject 
to the control of the government. Actually, this is not the case. 
The budget equation itself is not, strictly speaking, an equality. 
It gives the maximum that total expenditure can attain, not the 
amount it must attain. The authorities can ration all n com- 
modities even if this means that the individual cannot spend all 
his income. 

However, this raises a problem of terminology. Cannot money 
itself be counted as a commodity? Secondly, when we speak of 
total expenditure, do we include as one of its components “saving” 
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in the sense of expenditure on future goods? Upon reflection I 
think the reader will conclude that this is primarily a verbal prob- 
lem, much aggravated by the customary loose and ambiguous usage 
of the concepts of “money” and numeraire. 

Various conventions are open to us, any of which is satisfactory 
provided that it is adhered to consistently. We may include 
among our commodities goods of different periods of time, and we 
may assume certain expectations with respect to future prices. 
For the present purpose it is simpler to assume that total expendi- 
ture upon present commodities can differ from present income by 
the algebraic amount of saving without going into the problem of 
the form (cash, securities, etc.) that these savings take. If saving 
is determined by the usual preference calculus, certain implicit as- 
sumptions are made with respect to future incomes, prices, ordinal 
rates of time preference, etc. 

If it wishes to, a powerful government can arbitrarily limit the 
consumption of all present goods, and at the same time permit only 
an arbitrarily small amount of the surplus of income to be carried 
over for future uses. Ordinarily, in connection with a rationing 
program the State does not so limit saving, relying instead upon 
a personal tax program to bring about this desired result. Further 
discussion of this point may therefore be deferred. 

Single commodity rationing has certain shortcomings which 
give rise to a demand for some form of “point-rationing.” Instead 
of limiting the amount of a single commodity, the individual is 
limited to a weighted sum of a number of commodities, the point- 
prices providing the relative weights. But such a weighted sum 
represents nothing more than a single commodity in a new trans- 
formed set of variables. It is for this reason that the general 
theory of rationing belongs in the present chapter on transforma- 
tions and composite commodities. 

It would lead us into the field of welfare economics if we were 
to discuss the criteria used in classifying commodities into groups 
and the determination of their point values. Suffice it to say that 
administrative considerations, criteria of substitutability on the 
consumption and production sides, all contribute to the decision 
as to which commodities shall fall in the same group, what the 
relative values of each shall be, how many groups there shall be, etc. 
Moreover, although this is not strictly necessary, each commodity 
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is usually restricted to but one group so as to obviate the necessity 
of handling many kinds of tokens or stamps in making a given 
purchase. 

In addition then to the dollar prices, (pi, • • ■ , p n ), and money 
income, I, each consumer is confronted with r classes of point 
prices, (pi, ■ ■ •, pn), ( pi ", ■ ■■, pn"),, •••, (pi T , -, pn), and fixed 
totals of expendable points per unit time, (I 7 , I", • • •, I r ). Each 
of the sets of point prices will have zeros for most commodities, and 
for a given commodity all but one point price will usually be zero. 

This being the case, the consumer will maximize U(x u • • - , x n ) 
subject to the generalized budgetary constraints 

E pjXj^ I, E pi% = E PfXj s r. (128) 

11 1 

For the moment the matrix 3 will be assumed to be of rank 
(r-f-1). There will result optimal amounts of each good for each 
specified complete set of point-prices and expenditure allotment. 
We may summarize this result by writing down the generalized 
demand schedule foqeach good, written as a function of all prices 
and points and all total incomes or expenditures. Thus, 

== h*(Ph ' i Pn i Pi , ' , Pn * ' ' ' i Pi , ' > Pn J I, I , ' i I T ) • 

(* = 1, *••,«) (129) 

It is the purpose of the regulating authorities so to determine point 
prices and allotments as to result in “equitable” inter-individual 
amounts of consumption, appropriate total output, at appropriate 
dollar prices. In addition to such policy matters, the economist is 
interested in the purely positive problem of determining the prop- 
erties of these demand functions implied by the ordinal maximiza- 
tion process. 

For this purpose we must examine the maximum conditions of 
equilibrium. By the usual Lagrangean multiplier technique the 
first order conditions are easily seen to be 

U i \pi + \'pi + • * • + Vpi T = 0, (i = 1, •••,») (130) 

provided that all of the group allotments are “effective” as indi- 
cated by the presence of the equality signs in (128). If any par- 
ticular group allotment is ineffective for the individual in question, 
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its Lagrangean multiplier may be set equal to zero in the above 
equations. 

In words : the consumer will purchase each good up to the point 
where its marginal utility is equal to a weighted average of its 
various dollar and point prices, the weights being the marginal 
utilities of the last dollar or group ration coupons. 81 

For a regular maximum our necessary and sufficient secondary 
conditions are contained in the statement that the Hessian of the 
utility function must represent the matrix of a quadratic form 
which is negative definite under the {r + 1) linear constraints. 
This is equivalent to certain conditions on the matrix formed by 
bordering the Hessian with the matrix of the constraints, the 
transpose of the latter, and zeros. If we strike out vows and 
columns corresponding to each of (n — r -j- 1) commodities in 
turn, the resulting (n — r + 1) principal minors must oscillate in 
sign, the smallest one being negative, the next positive, etc. 
Thus let 

O', j = 1, • ••,m) 

(k,s = 0, •••,*-) (131) 

[m — (n — r + 1), • • • , n\ 

Then 

(- l) m A m > 0. (132) 

As throughout this book, the fundamental observationally 
meaningful restrictions on observable prices and quantities emerge 
as consequences of these secondary extremum conditions. In this 
case such implications can most simply be stated after the concept 
of a compensated price change has been suitably generalized. At 
least since the time of Slutsky it has been customary to deal with 
a compound change in price and income, where the latter is varied 
along with the former so as to leave the individual on the same 
indifference curve. 

When rationing enters the picture so that we have auxiliary 
constraints, the problem is made more complex. An increase in a 

31 This seems to have been pointed out in print for the first time by T. Scitovsky, 
“The Political Economy of Consumers’ Rationing,” Review of Economic Statistics , XXIV 
(1942), 114-124. Other theoretical aspects of rationing are discussed in H. P. Neisser, 
“Theoretical Aspects of Rationing,” Quarterly Journal of Economics , LVIX (1943), 
378-397. Also see N. Kalecki, “General Rationing,” Oxford Bulletin of Statistics, vol. 
III (1941). 
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given price, or in a given point price, will make the individual 
worse off. But he can be compensated for this in many different 
ways — by a change in money income, by a change in his point 
allotments of the same or of a different group, or by any combina- 
tion of these. It is most useful for the present purpose to consider 
that special compensated change in the dollar or point price of a 
commodity in which the individual is kept just as well off as before 
by means of a simultaneous change in the same total expenditure 
allotment (dollar or particular group point allotment). Let the 
change in the ith. good resulting from a change in the kth. group 
point price of the jth good when compensated as above be written 
in the form (dXi/dpj k ) com p .. Then regarded as an n by n matrix, 
with k a fixed number, this must be negative semi-definite of rank 
(n — r + 1). Thus, the compensated change in a good with re- 
spect to one of its own point prices must always be negative; in 
addition, we have inequalities on certain cross product and inter- 
action terms. These are expressed completely by the statement 
that the smallest naturally ordered principal minor of the above 
matrix must be negative, the next positive, etc., and finally the 
(n — r + 2)th and all higher principal minors must vanish. The 
last part of this statement is equivalent to the obvious proposition 
that the demand functions are homogeneous of order zero in each 
set of point prices and expenditure. 

If one wishes, off diagonal terms may be interpreted as general- 
ized complementarity coefficients, and it may be left as an exercise 
to the reader to show that these obey the usual symmetry rules. 
In fact, the reader may easily show that the matrix of compensa- 
tion terms is nothing more than the northwest partition of the 
inverse of the A„ matrix, after this submatrix has been premultiplied 
by the scalar (— X 4 ) 32 

The interested reader may also develop for himself the general- 
ized Le Chatelier-Braun principle introduced in chapter iff. In 
words, its economic significance may be summarized as follows: if 
in a given position of equilibrium a (compensated) change in price 
is made, the resulting change in amount demanded of that good 
will be greater if the individual is not subjected to the extra con- 
straints of rationing than if he is subjected to such constraints; 
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furthermore, the introduction of each new constraint will make 
demand still more inelastic. 

The tendency for demand to become inelastic under rationing' 
has a number of important implications for policy. In particular, 
it helps to explain why in a system which has already been subjected 
to numerous direct controls, fiscal measures aimed at sopping up 
superfluous income have effects which are of only secondary impor- 
tance. To explore this further would involve us in considerations 
of effective demand outside the scope of the present discussion. 

Most economists, will not be interested in the somewhat arti- 
ficial compensated change in point-price, but rather in the ordinary 
ceteris paribus change, in which other point-prices and expenditure 
are kept constant. It is easy to show by explicit differentiation 
of the equilibrium conditions, as well as from general considerations, 
that this may be split up into two parts: the compensated change 
and the income effect. Thus, 

(dXi/dpj k ) ct t. par. = (dx { /dpj k ) comp . — x 3 -(dXj/dI k ). (133) 

Except for (“generalized”) inferior consumption goods, the second 
term will be positive; consequently, a change in a good resulting 
from an increase in its point price must be negative in all normal 
cases. But if we have generalized inferiority, we may (but need 
not) have a generalized Giffen’s paradox, in which raising the 
point-price of a commodity causes more of it to be bought. 

Our discussion of rationing may be brought to a close with a 
few comments on the special mathematical features which dis- 
tinguish it from the general analytic case of any linear constraints. 
First, it is almost universally the custom to make all point prices 
positive, even though one could in theory imagine a case in which 
there were negative prices. (For example, ration tickets might be 
given to a consumer who will take certain redundant staples, etc.) 

Second, the authorities often set ration point-prices proportional 
to dollar prices. A special case of this is provided by dollar ex- 
penditure control on various commodities or groups of commodi- 
ties. An even more special case is that of over-all expenditure 
control, in which the individual is given a quota of dollar expendi- 
ture, ordinarily less in amount than his disposable income. If 
savings are not treated as one of the n commodities, this is equiva- 
lent to introducing linear constraints whose rank is less than 
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(r + 1); consequently the equations of (128) are inconsistent if 
equality signs are used. We must then have recourse to inequali- 
ties; of course, in this case the dollar budget equation must be 
relaxed in favor of the new quota allotment. 33 In rare cases where 
ration tickets or quotas are over-generous, the auxiliary constraint 
will become an inequality and the rationing will be ineffective. 

A different mathematical problem arises if the consumer can 
pay for a commodity by giving up a certain number of ration 
stamps of one kind or by giving up some of another. Depending 
upon the relative prices and scarcities of the different stamps, he 
will ordinarily choose to spend one rather than the other in a 
fashion reminiscent of Gresham’s Law. If he has similar options 
with respect to many commodities but at different relative ratios, 
he will spend a given kind of stamp on those commodities in which 
it is relatively advantageous in a manner formally identical with 
the classical theory of comparative advantage as applied to the 
determination of which of many commodities will be exported and 
which will be imported. 

The above case merges into that in which the different kinds 
of stamps can be converted into each other at rates determined on 
a black or white market, or by the government. Unless the 
government were explicitly to ban such transactions, there would 
inevitably arise trade in the different ration groups and money. 
Arbitrage would create ruling market ratios of exchange which any 
individual as a small consumer could not appreciably affect. Once 
the individual is confronted with given rates at which he can buy 
or sell each type of ration points, the problem is no longer one of 

33 Since in practice a commodity is not given more than one ration- point price, 
“degeneracy” cannot arise in any other way. If the authorities should require multiple 
point prices for a given commodity, unless care is taken the consumer may be put into 
the position of not being able to spend all his points. This difficulty may arise in two 
different ways: through true degeneracy and inconsistency in the auxiliary constraints, 
or from the fact that the admissible solutions of the linear equations do not yield positive 
amounts of all commodities. When this is the case, the first order conditions of equi- 
librium given in (130) are modified, but not as before by setting certain of the Lagrangean 
multipliers equal to zero, but rather by replacing certain equality signs by greater than 
signs. Any commodity which is not bought will have a marginal utility less than the 
above specified weighted average of point prices. The problems raised in this footnote 
are similar to those discussed by Schlesinger, Wald, Von Neuman, and Von Stackelberg 
in connection with the consistency and independence of the equations of the Walrasian 
theory of production in its simplest constant coefficient form. 
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many auxiliary constraints. Thus, if orange stamps can be bought 
or sold for five cents apiece, and if a commodity costs twenty-five 
cents and two orange stamps, it can simply be regarded as a com- 
modity which costs thirty-five cents. The consumer simply reverts 
back to the standard type of budget equation, modified only by 
the fact that his money income may be increased or decreased by 
the need to buy or sell ration points; or, what is simply another 
way of saying the same thing, the individual has higher income and 
is confronted by a new higher set of prices. If the dollar values 
of the first, second, • • •, and rth ration points are (&', b", ■ ■ •, b r ), 
the new budget equation becomes 

E (Pi + E b k Pi k ) Xi = I-f Z b k P. (134) 

1 X 1 

Actually, from a welfare point of view it can be shown that the 
free interchange of different kinds of stamps, among themselves 
and against money, is in a certain sense optimal. In fact, it is an 
indirect way of permitting individuals to exchange goods so long as 
the exchange is mutually advantageous. These last two sentences 
must be qualified. They must not be taken to mean that every 
individual will be better off if coupons or goods can be interchanged. 
Thus, if the rich buy redundant ration coupons from the poor to 
the mutual advantage of both, their expenditure by the rich will 
cause an increase in the point price of the scarce rationed goods in 
question to the disadvantage of the middle classes. 

Nevertheless, it can be shown that free interchange is optimal 
in the sense that its introduction accompanied by appropriate 
modifications in the point allocations to every individual could 
lead to an improvement for everybody. In the above example, 
the middle classes could be bribed into acquiescence, and there 
would still be a margin of advantage left for rich and poor. I do 
not imply by this that the middle classes should receive such a bribe 
since that would suggest a belief in the perfection of the previous 
status quo. Nor should it be thought that anything said here is an 
argument for making coupons interchangeable, since there might 
very well be in actual fact very grave difficulties in the way of 
devising a method of point allocation which would recognize the 
harm done to particular individuals. 



CHAPTER VII 


SOME SPECIAL ASPECTS OF THE THEORY OF 
CONSUMER’S BEHAVIOR 

Chapter v exhausts the content of utility analysis in its most 
general form, involving only an ordinal preference field. There 
remains in the literature a great number of discussions of particular 
problems which involve special and extra assumptions. In order 
to present a fairly complete account of the present status of the 
theory I propose to examine some of these carefully to show their 
empirical meaning. This involves a break in the unity of exposi- 
tion since each special assumption has often been made independ- 
ently of all others. There is no choice but to go through the list 
with no regard for continuity. Among the topics to be discussed 
will be the cardinal measure of utility, independence of utilities 
and measures of complementarity, and constancy of the marginal 
utility of money. 

It is clear that every assumption either places restrictions upon 
our empirical data or is meaningless. A price must be paid for 
any simplifications introduced into our basic hypotheses. This 
price is the limiting of the field of applicability and relevance of 
the theory because of the extra empirical restrictions to be im- 
posed on the data. Many writers do not appear to be aware of 
this; in any case few have indicated the costliness of their assump- 
tions or have adduced any evidence to support a presumption of 
their admissibility. 

There is a further serious difficulty. Despite the fact that 
developments in this field are not recent, and that mathematical 
methods of exposition have been employed, ambiguity still per- 
meates the contentions of many writers. This ambiguity can go 
unnoticed precisely because there has been so little interest in the 
operational significance of these assumptions. To put the matter 
somewhat harshly, ambiguously defined assumptions are used to 
give a semblance of deriving theorems which are themselves 
inconclusive. 
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A recurring source of difficulty in this connection which goes 
back as far as Marshall is the practice of introducing certain mathe- 
matical relations as alleged “approximations.” These are pre- 
sented as being valid in the neighborhood of a point of equilibrium. 
Even if such a relationship be admissible (as in the case of Taylor’s 
expansion), precisely because it can be applied to any and all 
properly continuous functions it is devoid of meaningful sig- 
nificance. Moreover, in common usage the restrictions which are 
introduced as allegedly holding only for a restricted neighborhood 
of equilibrium are in fact used to deduce results which follow only 
from an entirely different interpretation of these assumptions. 
We shall have occasion to enter more deeply into these matters at 
later points in the discussion. 

The Cardinal Measure of Utility 

We have seen from chapter v that all empirical market behavior 
is independent of the choice of a particular index of utility, or 
indeed of the choice of any measure of utility at all . 1 Nevertheless, 
many writers have wished to introduce the concept of a cardinal 
measure of utility, unique except for constants of scale and origin. 

Lange , 2 Fisher , 3 and others have contended that measurable 
utility, while superfluous from the standpoint of positivistic be- 
havioristic description, is necessary for the purpose of a normative 
science of welfare economics. 

While I cannot consider this problem in detail, it is well to point 
out that this is not at all necessary. Assuming that Welfare Eco- 
nomics involves comparisons between individuals, it is sufficient 
that explicit welfare judgments be made such that we are able to 
relate ordinally all possible combinations of goods and services 
consumed by each and every individual. Nothing at all is gained 
by the selection of individual cardinal measures of utility . 4 

1 Some writers have partially recognized this fact, but still maintain that the avoid- 
ance of the use of utility is a “stunt,” an axiomatic experiment by means of which we 
make our way more difficult. From the present operational orientation this view is 
clearly superficial. 

2 Oscar Lange, “The Determinateness of the Utility Function,” Review of Economic 
Studies , I (1934), 218-225. 

3 Irving Fisher, “A Statistical Method for Measuring ‘Marginal Utility' and Testing 
the Justice of a Progressive Income Tax,” in Economic Essays in Honor of John Bates 
Qlark (New York, 1927). 

4 Of course, if one considers general welfare as the algebraic sum of individual cardinal 
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In any case it is clear that there are an infinity of ways of select- 
ing a particular utility index and defining it as the true cardinal 
measure of utility. Thus, one might draw a straight line from 
the origin through a point representing an arbitrary bundle of 
goods and services. The numerical distance of any point on this 
line from the origin could serve as an index of utility, and might 
be dignified with a unique title. 


The Assumption of Independent Utilities 


A method of selection such as that outlined above is obviously 
arbitrary. Others are more subtle, but equally arbitrary. Pro- 
fessors Fisher 6 and Frisch , 6 employing only price and quantity 
data, haVe measured what is alleged to be marginal and total 
utilities and how these magnitudes vary with changes in income, 
quantities, etc. Although the exact technique differs between 
these two writers, the fundamental principle remains the same. 
This can be illustrated with reference to a simplified case where 
only two consumption goods are involved. 

Given a great number of observations on prices, quantities, and 
total income, one could in the limit more or less trace out the whole 
indifference map. We should still, however, have said nothing 
about the numbering of the one-parameter family of indifference 
loci so traced out. Both Professors Frisch and Fisher employ the 
following definition for selecting out a particular utility index to 
be designated the “true” measure of utility, subject to origin and 
scale constants. That utility index, if it exists, is to be selected 
which can be written in the form 


i.e., for which 


<P = /(*) + g(y); 

4 


ay 

dxdy 



( 1 ) 

( 2 ) 


utilities, then one will require the cardinal measurability of utility. But such an assump- 
tion is arbitrary and gratuitous. Cf. A. Burk (Bergson), “A Reformulation of Certain 
Aspects of Welfare Economics,’ ' Quarterly Journal of Economics , vol. LII, no. 2, Febru- 
ary, 1938, pp. 310-334. 

5 Fisher, op. cit. 

6 R. Frisch, "New Methods of Measuring Marginal Utility,” Beitrage zur okono* 
mischen Theorie , no. 3 (Tubingen, 1932); H. Schultz, The Theory and Measurement of 
Demand, pp. 111-117. 
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If "there exists one index of utility which can be written in the 
form of equation (1), then any other index which obeys the same 
law must differ from it only by a linear transformation. For, 
consider another index 


for which 


F = F(cp), 


<PF 

dxdy 


0 . 


(3) 

(4) 


Successively differentiating (3) partially with respect to x and y, 
we get 


d 2 F 

dxdy 

— F r <pxy + F rf <p x (py, 

<% 

(5) 

This, together with (2), requires that 



F"<Px<Py = 0, 

(6) 

or 

in 

p 

(7) 

Therefore, 

F = d -f- b<p y 

(8) 

where a and b are origin and scale constants respectively. 



It is clear, therefore, that the assumption that utilities shall be 
“independent” will help to select one utility index as the cardinal 
measure of utility. Nevertheless, even this convention is not in 
general applicable. It will guarantee us that we do not have two 
different utility scales, as has been shown in the above proof; it 
will not, in general, provide us with even one scale. 

If we assume an indifference field obeying the ordinary con- 
cavity restrictions and nothing more, then there will not, in general, 
be even one utility index which can be written in the form 

<p = f(x) + g(y), 

<Pzy — 0 * 

Let us write out one legitimate utility index. 

H = H(x, y). 

Does there exist a transformation F such that 


(9) 

( 10 ) 

(ID 


cp = F[H{x, y)] = fix) + g(y)? 
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The answer in general must be in the negative. Further arbitrary 
restrictions must first be placed upon the indifference field. 

Let the indifference field be defined in the following form, inde- 
pendent of any utility concept: 




. dx J H ^constant 


= R(x, y ), 


( 12 ) 


where R is a function of x and y obeying the following curvature 
requirements 

R x - R y R < 0. (13) 

The necessary and sufficient condition that there exist a utility 
index which can be written in the form 

<P = fix) + g{y), 

‘•Pxy — 0 


is as follows: 


or 


RR X y — R*Ry 
d 2 log R 


0 , 


dxdy 


= 0 . 


The necessity is verified by the differentiation of 

R( x _ f ( x \. 

[,y) g'iy) 


(14) 

(15) 

(16) 


The sufficiency is also easily indicated. 
If 

d a log R _ 
dxdy ~ 


0, 


log R = log h(x) — log k(y) = log 


h(x) 

k{y) 


where k and k are arbitrary functions. Form the expression 
Rdx + dy = dx -f- dy. 

This can easily be transformed into the exact differential 
d<p = h(x)dx + k(y)dy, 


(17) 


(18) 


( 19 ) 
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or 


r x r y 

<P = I h(x)dx + I k{y)dy + constant 
Ja J& 

= /(*) + g(y). 


( 20 ) 


We must now investigate the meaning of the restriction in (14). 
The assumption of independence of utilities in order to define a 
cardinal measure of utility is seen to involve (1) a convention by 
means of which one out of an infinity of possible utility scales is 
designated as the true cardinal measure of utility; (2) an arbitrary 
a priori restriction upon the preference field, and hence upon em- 
pirical price-quantity behavior. The meaning of this restriction 
we must now investigate. 

The functional restriction (14) is a partial differential equation 
of the second order of the general form 


M(R, R x , R y , R xx , R yy , R xy , X , y) = 0. (21) 


Subject to boundary conditions involving two arbitrary functions, 
it will serve to define a unique solution function 


R = R(x, y). 


( 22 ) 


More specifically, if we are given as empirical observational data 
the two expenditure paths corresponding to the changes in quan- 
tities with income in each of two respective price situations, then 
from these observations, and these alone, the whole field of in- 
difference curves can be determined by suitable extrapolation. 

It is not easy to visualize intuitively why this should be so; 
indeed, few economists would be so bold as to claim that the be- 
havior of an individual in all conceivable circumstances should be 
derivable from so few observations. And yet this is the conclusion 
to which we are forced by the apparently innocuous assumption of 
independence of utilities. 

Moreover, (14) places definite restrictions on our demand func- 
tions, the validity of which are equally dubious and equally im- 
possible to comprehend intuitively. For the simple two-commodity 
case our conditions of demand equilibrium can be written 


|r = R ( x > y)> 

r V 

I = P x x + P„y. 


(23) 
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These may be transformed into 



These demand equations must be subject to the restriction 


d 2 log jr 

£_v 

dxdy 


(25) 


When there are more than two goods, the restrictions implied 
by the very possibility of an independent index of utility take on 
a different and more complicated form. If there exists an index F 
for which 

Fij = 0, 0* ^ j) (26) 

then 

Fa = F'(<p) <p tj + F f (<p)<pi<pj s 0, (27) 

where <p is any other index. Thus, 


tpij 

<Pi<Pj 


= n<p), 


(i ^ j) 


(28) 


where T is an arbitrary function, and <p is any index of utility. 
Taking into account the (n — 1 )(n — 2)/2 conditions of integra- 
bility, this implies an additional n(n — l)/2 conditions. It is to 
be noted that these are identities, holding everywhere. Not only 
are they necessary, but the transformation 

F = f V eS° Tm9 dv (29) 


shows them to be sufficient as well. 

In terms of the indifference varieties these take still another 
form. Let 


pj d>0C"i 

p% dxj 



These, of course, satisfy the identities 


( 30 ) 
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If an independent, index of utility is possible, then we must have 
d 

-JZT~ = 0. (7 7^ k 7 * i) (32) 


In view of the n 3 relations of (31), the above n{n — l) 2 relations 
are not all independent. At most n(n — 2) are independent, and 
these may be written in the form 



0 ^ j), 0,j = 2, •••,«) 


(i = 3, • • • , ri) 


(33) 


These conditions are both necessary and sufficient. They im- 
ply among other things the ( n — 1 ){n — 2)/2 integrability condi- 
tions of equation (14) of chapter v. On the other hand, if the 
latter are postulated at the beginning, then equations (33) cease 
to be all independent and can be reduced in number. 

We have then n{n — 2) partial differential equations of the first 
order. Subject to an equal number of arbitrary functions, the 
general solution is uniquely determined. But empirically an ob- 
servation of an expenditure path involves (n — 1) functions. 
Hence, observation of more than n{n — 2)/(n — 1), or more than 
n expenditure paths, could be used to disprove the possibility of 
independence. 

In words, the implications of independence are that the amount 
bought of any good x, can be expressed in terms of its price pu 
the price of any other good pj, and the amount of expenditure upon 
these two goods alone, i.e., upon the amount left over out of total 
expenditure after all other goods are bought. Thus, the general 
demand function 

Xi — ‘ • pm I) 

can be written in the special forms 

h%pi, •••,#„,!) = h ij (pi, p Jt I + pjXi + pjXj - i p k x h ) (34) 

1 

for any i not equal to j. Subject to the conditions of (25) and 
certain consistency relationships, these conditions seem to be 
sufficient as well as necessary. 
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Except for R. G. D. Allen’s derivation of equation (15) for the 
two-good case, I am not aware that these full price-quantity impli- 
cations of independence have previously been derived. 7 However, 
fragmentary sets of necessary but by no means sufficient conditions 
for independent goods have been derived by Slutsky and by Milton 
Friedman. 8 They differ fundamentally from those above, and are 
of a species which I choose to call local implications of independ- 
ence. At a given point in the ( X ) space it may be possible to find 
a transformation F(<p) such that 

' Fa = F'cpij + F"cpi<pj = 0. (i y* j) (35) 

Thus, if there are only two goods (xi) and (x 2 ), this is always 
possible gt every point so that the local conditions degenerate into 
trivial identities. At any point ( X ) = (xi, x 2 ) , let 

F"(<p) _ MX) .... 

F'(<p) ~ tpx {X)MX) ’ (JOj 

and 

Fn(X) = 0. (37) 


This differs from the previous non-local conditions in that the 
above relation is an equality at a point and not an identity. 

In the general case of n goods a local transformation of this 
type is possible only if the following relations hold, 


MX) MX) 

MX) MX) ~ MX) MX) ’ * 3 * lj 


(38) 


These are [_n{n — l)/2 — 1] independent conditions, and as before 
need not be identities. Even if they hold everywhere, independence 
is not necessarily implied. If the local conditions hold, the bordered 
determinant of the type given in equation (34) of chapter v takes 
on a special form, and consequently certain restrictions upon the 
demand functions can be derived. 

Slutsky 9 shows that 

(39) 

7 R. G. D. Allen, “A Comparison between Different Definitions of Complementary 
and Competitive Goods,” Econometrica, II (1934), 168-175. 

8 Milton Friedman, “Professor Pigou’s Method for Measuring Elasticities of Demand 
from Budgetary Data,” Quarterly Journal of Economics, November, 1935, pp. 151-163. 

9 E. Slutsky, “Sulla teoria del bilancio del consumatore,” Giornale degli Economisti , 
LI (1915), 23-26. 
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where 


P = 


K u 

K 12 

••• K ln 

dxi 

di 

K,, 

K 22 

••• K in 

dx 2 

~dl 

K ln 

K 2n 

■Knn 

dX n 

di 

H 1 

H 

<T> 

dx-j 

dx n 

0 

di 

di 

IT 


(40) 


and the subscripts indicate appropriate cofactors. Also 

d log'(- X) 


= 

in the notation of chapter v. 

If an index exists such that 


dl 


at a point, then 

dp ~ 


Fij 33 0, (i ^ j) 

Pn Pi >i 


Pn+l,n+l.lZ Pn+l,n+l.ij 


. (i ^ j 5^ 1) 


(41) 

(42) 

(43) 


These are \ji{n — l)/2 — 1] independent and meaningful restric- 
tions on the demand functions and completely exhaust the local 
implications of the independence assumption. 

I should also like to point out that equations (43) might be used 
to determine the marginal utility of income if (1) independence is 
possible ; (2) that utility index which can be written as an additive 
sum of independent utility is defined as the “true” cardinal index 
of utility. I commend it to the attention of the never lacking 
army of utility measurers. 

Mr. Friedman has derived the following n(n — 1) local inde- 
pendence conditions: 10 

k :) ~~ Vjt) 


— _ ( — - ~ k£ML \ ] r hL &. 

Vii \ Vji 1 &iVii ) Vi i . 


kjVii 


( 44 ) 


In the two-dimensional case the restriction is vacuous. 


10 Friedman, op. cit. f p. 162. 


1 82 


FOUNDATIONS OF ECONOMIC ANALYSIS 


From the fact that their number is excessive, it is clear that 
they cannot all be independent. In fact, I am unable to prove 
that they are complete, i.e., equivalent to the above Slutsky condi- 
tions; indeed, I would venture the conjecture that they are not . 11 

It would be literally impossible for any individual to determine 
by introspection whether or not his demand functions satisfied the 
above relations. Indeed, the likelihood that such relations, se- 
lected arbitrarily from an infinity of possible functional relations, 
should hold is infinitely small. The minute amount of plausibility 


n On page 162, footnote 2, Mr. Friedman argues against the admissibility of the 
Pigou theorem that 


when 


ki 


= 0 


h 


(*) 


JHi V il 

Vii Vil VU Vil 

on the grounds that . . there is a presumption that mi and ki [my notation] are 
inversely related.” The whole problem is of course ambiguous until a particular set of 
indifference curves are specified. Nevertheless, within the realm of probability and 
presumption, Mr, Friedman’s statement seems to be incorrect. In chapter v it is 
shown that 


22 kiVH 

2> 


= 1; 


i.e., a weighted average of all income elasticities equals unity regardless of the number 
of goods. The average k is given by 


E h 


1 


This approaches zero as the number of goods increases. 

A survey of empirical budget studies will convince the reader that the income 
elasticities are distributed around unity, while the proportions vary around 1/n . 

See also A. C. Pigou and N. Georgescu-Roegen, “Marginal Utility of Money and 
Elasticities of Demand,” Quarterly Journal of Economics, vol. L, no. 3 (1936). 

The above Pigou theorem rests, nevertheless, upon the restrictive assumption of 
independence. If it could be shown that it holds for small values of ki/r\n in every case, 
it would be much more useful. 

That this cannot be true in general is shown by the following special case. Let 
consumption of n goods be completely “joint” in the sense that there are always fixed 
proportions. Then regardless of the number of goods, mi s 1, and mi fan 25 1* On 
the other hand, it is easily shown that s . Consequently, mi fan ~ fa/kj. 

Even though we make the k*$ become arbitrarily small as compared to the income elas- 
ticities, their ratio may take on any value other than unity. Thus, the Pigou theorem 
(*) is false. 

Of course, this example rests upon discontinuities in the higher derivatives, but I 
should think that we could approximate this condition by appropriate choice of con- 
tinuous derivatives. 
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to the proposition that independence is an admissible assumption 
comes from the fact that the question is usually posed in such a 
way that independence appears to be an intermediate class between 
the extreme classes of competitiveness and complementarity. Em- 
ploying a rudimentary concept of the equal probability of the 
unknown, one is inclined at first blush to acquiesce to the inde- 
pendence assumption. The error involved in so doing is obvious 
from our previous discussion, completely aside from the crucial 
ambiguity attaching to the older notions of independence and 
complementarity . 12 

Complementarity 


In my opinion the problem of complementarity has received 
more attention than is merited by its intrinsic importance. n Never- 
theless, as a result of the interest in this subject, crucial incon- 
sistencies in the thought of Pareto were revealed by Hicks and 
Allen, and much light was thrown on the cardinal and ordinal 
conceptions of utility. 

The older writers, Fisher, Pareto, and Edgeworth, suggested 
as a qualitative definition of complementarity between two goods, 
Xi and Xj, the sign of the cross derivative of the utility function; 
i.e., goods were complementary, independent, or competitive de- 
pending upon whether 


-i!sL.a 0 

dXidXj < 


(45) 


If one assumes only an ordinal preference field, all numerical utility 
indexes are equally admissible. This cross derivative is not in- 
variant in sign under a change in the index of utility. Consider 
a monotonic transformation of <p. 

U = F{<p\ 


dF 

d(p 


> 0 



<Ph 


(46) 


TT - d -l JlH 
Uij ” dp 9ij + d<p 2 <pi9 ^ 


12 It may be argued that regarded purely as a working hypothesis the facts do not 
sharply contradict the independence assumption. A little investigation reveals that 
such a hypothesis has not been tested from this point of view. On the contrary, It Is 
implicitly assumed from the beginning in the manipulation of the statistical data. 
Hence, one would have to go back to examine the original empirical data. It is interest- 
ing to note that observations on three expenditure paths would be sufficient to contradict 
the independence assumption in the case of two commodities. 
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By a proper selection of the arbitrary term, d 2 F/dcp 2 , the sign of 
the new cross derivative can be made to differ from that of the old. 
As a corollary of this lack of invariance, complementarity so defined 
will have nothing to do with the budgetary habits of individuals 
with respect to the two goods in question. Similarly, an indi- 
vidual’s introspective, spontaneous, intuitive designation of two 
goods as complementary or competitive (sugar and coffee, beef or 
pork, etc.) corresponds not to such a measure as this, but rather 
to behavioristic properties of the preference field and demand 
functions. 13 

In the course of the last five years the world’s best economists 
have spent considerable time and energy in the study of Professor 
Hicks’s Value and Capital , 14 It is perhaps symptomatic of the 
essential unimportance of the concept of complementarity that in 
this period no one seems to have noticed that the author gives two 
or more distinct (and inconsistent?) definitions of complementarity. 
The verbal definition in the text (p. 44) seems to be different from 
that in the mathematical appendix (paragraph 8). 15 


13 Hicks and Allen, Slutsky, and Schultz have suggested invariant measures of 
complementarity which are properties of the indifference curve system and demand 
function. Perhaps the simplest measure of complementarity between two goods, Xi 
and Xj, is the sign of 


Km 


dXi d%i 

1 T~ + x i Tf ~ 

dpi dl 


or 


= 


Ljjpj 

Xi 


If there are but two commodities, this must always be positive in sign; in the many 
commodity case at least one must be of positive sign in order to satisfy the relationships 

X/ PjKii — 0, (i = 1, * * *, ft") 

1 

Ku < 0 . 

Professor Leontief has suggested the following invariant measure of independence, 
corresponding most closely to the old notions: 

a*log(— ) 
a 2 w = a \Uj) _ 

dx%dXj dX\dXj 

Since the above lines were first written. Professor Hicks's two works, Value and 
Capital and Theorie Mathematiqus de La Valeur , have appeared. I still believe, however, 
that Hicks’s own solid contributions to economic theory do not rest on his treatment of 
complementarity, and that the extended discussion of the concept is a tribute to an old 
love rather than the necessary consequence of the subject matter. 

14 J. R. Hicks, Value and Capital (Oxford, 1939). 

15 Actually, there is still a third definition in the footnote on page 44. Only in the 
case of three goods is it clearly equivalent to one of the other two; in the general case it 
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This may be seen in many ways. While the mathematical 
definition can be applied to the case where there are only two goods, 
the literary definition cannot. (It is irrelevant that in the case of 
two goods the mathematical definition of complementarity admits 
of only one possible algebraic sense.) There is no reason why two 
distinct definitions should give the same answer in any particular 
case, so it is not surprising that one can invent examples ad inifini - 
turn for which two goods, such as wheat and linen, are on one of 
the definitions complements and on the other substitutes. But as 
we shall see in a moment, things are in even a worse state. Ac- 
cording to the definition of the literary text, it is possible for wheat 
and linen to be complements and substitutes at the same time, 
depending on the selection of the third good which is to> serve as 
numeraire. The definition is ambiguous; instead of reflecting the 
properties of two goods, it (or rather they) represents the properties 
of three goods. 

The mathematical appendix definition gives as a coefficient of 
complementarity between the it h and jt h good the element of the 
ith row and jth column of the matrix (^/) where 

x = *(p, U) (47) 

represents In matrix notation the demand functions, for each 
indifference level. 

Because of the homogeneity property discussed in chapter v it 
is clear that these n demand functions cannot be inverted to give 
each p in terms of the x’s and U. Only relative prices can be 
determined. We can use any one price as numeraire, discard one 
of the above equations and Invert the others so as to give (n — 1) 
relative prices in terms of (n — 1) goods and the ordinal level of U . 
It is most convenient to omit the quantity of the numeraire, x h > 
and its demand equation. This gives us 

~ = k N%x 1 , • • Sfc-i, #*+i, •••,*«, U). (i n) (48) 

ph 

is ambiguously worded, since the offer of the maximum amount of money by the con- 
sumer to attain given amounts of the two goods in question will change the amount 
bought of all other goods. It would appear that the author intends that all other (than 
the two goods in question and the numeraire good) goods are to be held constant, in 
which case it becomes identical with the verbal definition. The difficulty may reside in 
Hicks's unique use of the concept “money.” 
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Let (*JV/) == Nic be the matrix of this transformation. Then the 
element corresponding to the it h price and jth quantity represents 
the literary definition of complementarity. It can easily be shown 
that this, like the first matrix, must be symmetrical. In fact, it 
is necessarily negative definite, like the \p matrix. 

These last two statements are consequences of the definition of 
the inverse demand functions according to which 

N k = fa-'&rVij), (49) 

where ^ is defined to be the matrix formed by omitting the kth 
row and &th column from the original \p matrix. Since it is known 
from earlier chapters to be negative definite, so too will be its 
inverse. 

While the diagonal elements of the N and \p matrices must agree 
in sign, being negative, the off diagonal terms need not agree in 
sign except in the empirically unimportant case of three goods. 
Even more important, the element in a given row and column of 
the N matrix may be of different sign depending upon which good 
is called money or numeraire. A single instance will suffice to 
indicate this possibility. Suppose we have n goods in a preference 
field that can be typified by additive utilities. Let all but the last 
good have decreasing marginal utility. Then on the literary defini- 
tion the first two goods will be substitutes if the third good is used 
as numeraire ; unless the last good has decreasing marginal utility, 
when it is used as numeraire, the first two goods will be comple- 
ments. There is no reason why one of the goods, which we may 
call the last, cannot be an inferior good. Indeed, in the interesting 
borderline case where one of the goods has constant marginal 
utility, every pair of goods is independent on the literary definition 
when the good with constant marginal utility is used as numeraire; 
if any other good is used as numeraire, all other pairs not involving 
the good with constant marginal utility are easily shown to be sub- 
stitutes. But according to the definition of the mathematical ap- 
pendix, all pairs of goods not involving the one with constant mar- 
ginal utility must always be independent, whatever the numeraire, 

A typical element of Nk can be written out in terms of the slopes 
of the indifference loci, and the reader may verify for himself the 
dependence upon the subscript k. 
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Walras was scrupulously careful in his use of the concept 
numeraire. He distinguished between it and money, in the sense 
of the portable, divisible, cognizable, durable, conventional com- 
modity which served as a counter in performing exchanges. 
Marshall, when he spoke of money, ordinarily meant nothing more 
than income, as we shall see in the discussion of the constancy of 
the marginal utility of income. Hicks seems to alternate between 
the above uses, and a third sense in which money stands for a 
composite commodity made up of all but one or two of the total 
number of commodities. The result is a tendency toward am- 
biguity, of which the above example is only one instance. 

If complementarity is not of interest for its own sake, can it not 
still be of importance as an indicator of conditions whene certain 
“abnormal” phenomena are to be found? Thus, on page 71 in his 
discussion of the stability of a general equilibrium exchange system, 
Hicks says, “If income effects can be neglected, and if no comple- 
mentarity is present, then the system of exchange equilibrium must 
be stable.” It would seem, therefore, that the concept may be 
useful in indicating where a stable system will break down. Un- 
fortunately, this is not correct. The author has made a momentary 
slip, as he has since indicated elsewhere. If income effects can be 
neglected, then his matrix ( X TS ), being the sum of the negative 
definite symmetrical matrices of all individuals, must itself be 
negative definite and symmetrical. It must therefore be perfectly 
stable on the Hicksian definition, regardless of complementarity , 16 
If income effects cannot be neglected, then the matrix may be 
asymmetrical; according to the author asymmetry tends to make 
for instability. This is not quite correct. Pure asymmetry, and 
nothing else, tends to make for stability; the neglected income 
effects are the villains in the piece, and they do the most harm when 
their influence is not spent in creating asymmetry. 

Again in Hicks’ discussion of the effects upon the “period of 
production” of a change in the Interest rate, the concept of com- 
plementarity is introduced. 17 But here, too, a correct rendition of 
the secondary maximum conditions shows that complementarity 

16 Since these lines were first written, I find that Hicks has detected his error. CL 
j. R. Hicks, “Consumers* Surplus and Index Numbers,’* Review of Economic Studies , 
vol. IX, no. 2, p. 133, n. 2. 

17 Hicks, Value and Capital , p. 222, n. 1 and p. 328. 
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cannot be such as to make the Hicksian period of production de- 
crease with a reduced interest rate. 

The Hicksian average period of production is defined as the 
elasticity 18 of discounted value, C , with respect to changes in the 
discount rate, j8 = (1 + i)~\ Mathematically, it is given by 
(j8 dC/d(3)/C. Its precise usefulness seems not to have been ex- 
plicitly indicated. Among other things, the Hicksian “average 
period" has the anomalous property of not being an “internal 
mean," i.e., one whose value lies within the limits of the greatest 
and least of the time periods being averaged. Thus, it cannot be 
considered a generalization of the simple Bohm-Bawerkian average 
period of production. For example, consider the trivially simple 
“point-mput-pomt-output" case where 99^ worth of input is in- 
vested for one year to produce, say, $1.00 worth of output. The 
Bohm-Bawerkian average period, which unlike the Hicks concept 
requires a careful distinguishing between plus and minus items, is 
one year regardless of the interest rate. The Hicks period is 100 
years if the interest rate is zero and infinite if the interest rate is 
one per cent. In fact, for the so-called “marginal investment" it 
is always infinite. By introducing discount factors into the aver- 
age, the author hoped, perhaps, to meet the Knightian objection 
that the period is infinite ; but in many cases he seems only to have 
succeeded in making the finite infinite. 19 

But this is not the place to go into the deeper Knightian objec- 
tions to the Austrian theory. It will be sufficient, in passing, to 
state what seems to be the only essential theorem relating produc- 
tion planning to the interest rate. By the methods of earlier 
chapters, it can be easily shown that where the firm acts to maximize 

C ~ xo *4” Xi(3i + X2$% T* * * * d” x n $ n , (d0) 

18 As its author indicates, here is an example of an elasticity expression which is not 
dimensionless. For an analytic explanation of this the reader is referred to the first 
section of my chapter vi. 

19 Actually, it is not always necessary to introduce discount factors in order to make 
a process with infinite range yield a finite average. Many writers have shown how the 
distant elements receive smaller weights so as to create a convergent infinite series. 
Thus, imagine a perpetual stew, to which something is always being added and from 
which something is always being taken out, at random. Some part of what is now being 
added will never come out of the stew, just as some part of what is in the stew is of 
infinite age. But it is a simple exercise in infinite processes to show that the average 
age of the stew is finite, and the average expectancy of a particle’s staying in the stew 
is also finite. 
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a change in the discount factors must see the following inequalities 
realized 

E AxjAP , S 0. (51) 

0 

If the interest rates for all periods are equal, and if we go to the 
limit, we have the definite theorem 

E (dx,/dp)jp* = 0. (52) 

1 

Constancy of the Marginal Utility of Income 

I return after the above digression to an examination of another 
special restrictive assumption with which the analysis of con- 
sumer’s behavior is often burdened. For its own sake the problem 
of constancy of the marginal utility of income is one of the least 
important in economic theory, but it has given rise to an endless 
amount of discussion, most of it not on a very high level, and it 
therefore deserves brief notice here. Inasmuch as I have given a 
rather complete analysis of this matter elsewhere, 20 1 do not propose 
to do more than summarize matters here, and to comment upon 
some more recent contributions which became available after the 
writing of the cited essay. 21 

As will be seen in the next section, constancy of the marginal 
utility of income derives most of its importance from its relation- 
ships, real and alleged, to the subject of consumer’s surplus. In 
the beginning, it is well to point out that one ambiguity runs 
through the whole literature; sometimes constancy is interpreted 
to mean constancy, and at other times it is intended to mean 
almost constancy. The last usage generally involves some sort of 
limiting process; either the change in question is supposed to be 
“small,” whatever that might mean, or the percentage spent upon 

20 P, A. Samuelson, “Constancy of the Marginal Utility of Income,” Studies in 
Mathematical Economics and Econometrics: In Memory of Henry Schultz (University of 
Chicago, 1942), pp. 75-91. 

21 A. Henderson, “Consumer's Surplus and the Compensating Variation,” Renew of 
Economic Studies , vol. VIII, no. 2 (February, 1941), pp. 117-121. j. R. Hicks, ‘The 
Rehabilitation of Consumers’ Surplus,” same issue as above, pp. 108-116, J. R. Hicks, 
“Consumers’ Surplus and Index Numbers,” Review of Economic Studies , vol. IX, no. 2, 
pp. 126-137. Robert L. Bishop, “Consumer’s Surplus and Cardinal Utility,” Quarterly 
Jawmd of Ectiwmnm, vol. LVII, no. 3 (May,. 1943), pp. 421-449. 


FOUNDATIONS OF ECONOMIC ANALYSIS 



190 

some specified good is to be small, etc., etc. The problem is still 
more complicated because the alleged result, itself rarely worded 
unambiguously, is more often than not held to be “likely” rather 
than necessary. 22 I think, therefore, that we shall accomplish more 
if in the beginning we examine the literal implications of constancy 
of the marginal utility of income to show rigorously the empirical 
implications of this hypothesis. 

We have already encountered the marginal utility of income in 
chapter v in the form of the —X of equation (31). It is clear from 
the preceding equilibrium equations (29) that the marginal utility 
of expenditure upon each and every commodity must be the same, 
equal moreover to the marginal (rate of) utility derived from an 
extra doHar of expenditure distributed optimally over all of the 
goods. 23 

I have been meticulous in adhering to the usage of marginal 
utility of income, eschewing the more common terminology en- 
countered in the literature of the marginal utility of money . This 
brings up the second fatal ambiguity involved in the discussions of 
this subject. Money has many different meanings in theoretical 
discussions, ranging from an abstract non-metallic, non-paper, con- 
ventional unit of account, to specific counters, to commodities 
commanding wide acceptance in exchange, to any commodity taken 
arbitrarily as numeraire, to income or expenditure. It is reason- 
ably clear from everything that Marshall has written and from 
the cast of his thought that he definitely intended to convey the 
meaning of money simply as a euphemism for income or expendi- 
ture, reckoned in pounds or dollars. In his own words, it is “money 
or general purchasing power.” 24 In particular, it was foreign to 
his usage to think of money in the sense of numeraire, a concept 
which is not even listed in his index. 

This is a matter of importance when we come to ask the ques- 
tion, with respect to what is the marginal utility of income or 

22 This is another one of the numerous places where Alfred Marshall left matters in 
a fog. It was part of the flavor of the man not to bring things to a sharp focus. But 
what is forgivable in a genius cannot be tolerated among lesser mortals. 

23 This is simply one example of the YVong-Viner theorem discussed in chapter 
iii, p. 18. 

24 A. Marshall, Principles of Economics , 8th edition, p. 838. For a more complete 
discussion and for detailed citations the reader is referred to my essay in the Schultz 
volume. 
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money to be held constant? That Pareto might have interpreted 
this to mean constant marginal utility of a particular commodity 
chosen as numeraire, taken with respect to changes in the amount 
of that commodity, is understandable in view of the Lausanne 
usage. But that Hicks and others in the Anglo-Saxon tradition 
should have thought this to be the Marshallian meaning is much 
more surprising. Now at this date it is not a very significant ques- 
tion as to what Marshall himself really meant, but it is of impor- 
tance to show the implications of at least two distinct and incon- 
sistent meanings. Furthermore, many writers have definitely 
fallen into misstatements concerning the necessity and sufficiency 
of parallel indifference curves (the geometrical embodiment of the 
first of the above formulations) if various identities are to be 
realized. 28 

A closer examination of equations (29) and (31) will show that 
the marginal utility of income cannot be constant with respect to 
everything. It is a function of all prices and income. If we were 
to double all prices and income, the marginal utility of income would 
have to become halved. For such a doubling of income and prices 
would by the homogeneity property described in chapter v leave 
all physical quantities unchanged, and hence all real marginal 
utilities unchanged. But the marginal utility of income is given 
by dividing each real marginal utility by the respective price. 
With the numerator unchanged, and with the denominator doubled, 
the whole expression must be halved. If the marginal utility of 
income were constant with respect to all prices and also with 
respect to income, it would have to be unaffected by a simultaneous 
change in them all; since it must be halved by such a change, we 
have a contradiction. Consequently, the marginal utility of in- 
come cannot be invariant under changes in Income and each and 
every price. 26 

At most the marginal utility of income might be independent 
of all but one of these (» + 1) magnitudes. We can set n first 
partial derivatives equal to zero, but not (» + 1). Which n shall 

25 In addition there may be a misapprehension running through Hicks’s treatment 
in his book and the two articles cited. As Bishop has pointed out, the problem is not 
primarily one of whether or not income effects can be neglected. Further, there is the 
question of keeping real income constant or “adjusting” for changes in real income as 
seems to be implied in certain of the Hicksian statements. 

26 See Studies in Mathematical Economics and Econometrics , p. 76. 
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we choose? This can clearly be done in in -f- 1) alternative ways. 
One of these involves the constancy of the marginal utility of in- 
come with respect to n prices, but not with respect to income. 
Elsewhere I have argued that this is the pure Marshallian case. 
The other alternatives involve the constancy of marginal utility 
of income with respect to income and with respect to (n — 1) 
prices, or with respect to all prices but one. 27 This is the second 
hypothesis of constancy. 

In the Schultz volume essay I worked out the implications of 
each of these distinct hypotheses. In terms of everything that 
has ever been observed concerning price and budgetary data, the 
implications of each are highly unrealistic, although it is the second 
of these which leads to really fantastic conclusions. 

Before summarizing these empirical implications I should first 
point out that the marginal utility of income, being from a formal 
point of view a Lagrangean multiplier, must by the results of 
chapter vi, p. 132, not be left unchanged if we go from one utility 
index to another, as is our privilege in an ordinal preference field. 
At each point, for each different utility indicator, we have a differ- 
ent marginal utility of income. If the expression is constant for a 
given movement when we are using one indicator, it will not be 
constant if we use another indicator. If, to get rid of the minus 
sign, we define a new expression to be equal to —X, it is really 
necessary to indicate the cardinal index of utility to which it applies. 
We may represent the new expression by the letter m, and append 
a subscript to indicate the utility index in question. 

Then, 

Wip — TWip(j?h p2, ' , Pn j ■T) = X = f(p 1, p2, ’ 1 Pn t -O' (53) 

As shown in the Schultz volume this function must be homogeneous 
of order minus one ; by Euler’s theorem on homogeneous functions 
the sum of its elasticities with respect to each of the independent 
variables must equal minus one, so that each cannot vanish. 

27 The good whose price changes have an effect upon the marginal utility of income 
need not be the commodity selected as numeraire, and it might prevent confusion if we 
made a point of selecting some other commodity as “money/’ especially since there is 
no special reason within the scope of static theory why this special good should be 
divisible, durable, and have all the other properties of money. Nevertheless, in the 
literature the convention is followed of picking this good as numeraire. 
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If we subject our utility index, <p, to a monotonic transformation 
to give us a new index, U — F(<p), in terms of our new utility 
index the marginal utility of income has the properties 

mu — F'(<p)m v , 

dmu F ,dm* „ dip (54) 

da r da ' P da ^ 

Since we can make F" of any sign, the marginal utility of income 
can be made to change in any direction and amount we like, except 
in the special case which I have discussed elsewhere. 

Have we destroyed then all possibilities of speaking of constant 
marginal utility of income? Not quite, since we need only assume 
that there exists some one cardinal index (even if we ourselves pre- 
fer to use another cardinal index, or no one cardinal index) for 
which the strong, non-local restrictions of the two different hy- 
potheses hold. 

It is a mere exercise to show that the first, or pure Marshallian, 
hypothesis implies that the income elasticity of demand for each 
good must be unitary, and the price elasticity of demand for each 
good in terms of its own price must equal minus one. Moreover, 
the demand for each good is independent of changes in the prices 
of all other goods. There must exist one way of numbering the 
indifference curves so that utilities are additive and independent 
in the old sense ; actually except for arbitrary origin and scale the 
utility function is a linear combination of terms like ki\ogXi, 
where the k coefficients represent the unchanging proportions spent 
upon each commodity, x. These implications are both necessary 
and sufficient. 

In view of the well-known Engel’s laws and numerous budgetary 
studies it is hardly necessary to point out that this flies in the face 
of all observable reality, even as a first approximation in the sense 
of limitingly small changes. 

Performing a scarcely more difficult exercise in connection with 
the second hypothesis, it can be shown that it implies that the 
demand for each good except the numeraire depends only upon its 
own price in relation to the numeraire’s price, and is entirely 
independent of income, money or real. The demand for the 
numeraire depends in a corresponding way upon all prices and 
income. These conditions are both necessary and sufficient. 
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They imply among other things that the indifference varieties are 
parallel, their slopes being constant along the direction of change 
in the numeraire good alone. 

Of course no one has observed, and presumably no one ever 
will, a preference pattern in which all of extra income is spent upon 
one commodity . 28 Note that this is not even approximately true 
for instantaneous rates of change even when we neglect “second 
order of smalls.” 

Before we go into the reasons why anyone should ever have 
wished the marginal utility of income to be constant, a word may 
be said concerning the quasi-constancy of this magnitude. Of 
course, for a very small change in the initial situation, the marginal 
utility ok income will change very little, as with everything else in 
the system. But this is simply a consequence of continuity, and 
nothing else. It is quite another thing to say, and this is the 
only important thing, that the rate of change of marginal utility of 
income is small. Today the use of infinitesimals and of differ- 
entials is rather out of fashion with mathematicians, although a 
completely rigorous basis can be given for these processes. But 
in the Victorian era when Marshall was in his prime, these were 
old familiar standbys, and we may be sure that the man who was 
second wrangler to the great Lord Rayleigh was well at home with 
their use. It is strange, therefore, that he contented himself with 
gratuitous, and I believe incorrect, statements that changes in the 
marginal utility of income were of the “second order of smallness.” 29 
In the technical sense they are not; the fact that we are dealing with 
what can be regarded as a second derivative is not relevant here. 

If it may be of comfort to anyone cherishing a fondness for 
constancy, the following formula may be given indicating how the 
relative amount of expenditure upon a good ties up with the elas- 
ticity of the marginal utility of income with respect to a change in 

28 Perhaps King Midas was an exception, although his case involves certain dynamic 
considerations outside the present scope. For a one commodity maniac of his type it 
might not be inappropriate to measure all values in terms of the commodity in question; 
thus, the food necessary to keep a book collector alive might be reckoned as simply some 
fraction of a first edition of Adam Smith. 

29 Marshall, Principles , p. 842. The small germ of truth in the argument concerning 
“second order of smallness” lies in the fact that if each of the half dozen different 
consumer’s surplus concepts are plotted again ts a variable price (later called pf) t then 
at the original point, pp, they all have the common tangent, — g*. 


SPECIAL ASPECTS OF CONSUMER THEORY 


195 


the price of this good. 


Em t _ , / Em 9 Ext \ 
Epi ~ ki V El + El } ‘ 


( 55 ) 


Providing that the two expressions in brackets do not simultane- 
ously become large, and the last of these must for all goods average 
out to unity, then our left-hand elasticity must go to zero in the 
limit as the per cent spent upon the good in question goes to zero. 
But then of course any “surpluses” connected with this good 
become small, and there is no reason to distinguish between the 
orders of smallness. 


Why Consumer’s Surplus is Superfluous 

Of course, the problem of constancy derives its interest, I will 
not say importance, from the Marshallian concept of consumer’s 
surplus, about which an earlier generation of economists were able 
to indulge in much argumentation. Since many of the points in- 
volved were essentially mathematical in character, and since most 
of the antagonists did not go beyond literary methods, the discus- 
sion was not able to advance beyond a certain point, although most 
of the essential difficulties with the concept had been brought out. 
Later when there came about a renascence of mathematical meth- 
ods in economic theory, economists had lost interest in the problem, 
and the subject lived on primarily in the elementary textbook and 
in the classroom. 

In the present writer’s opinion this is as it should be. The 
subject is of historical and doctrinal interest, with a limited amount 
of appeal as a purely mathematical puzzle. These statements are 
made in cognizance of the fact that Professor Hicks has recently 
attempted to rehabilitate the doctrine of consumer’s surplus. It 
would carry us afield from the present task to analyze his conten- 
tions in detail, and so only a few ex cathedra remarks must suffice. 

In the first place, any judgment as to the usefulness or lack of 
usefulness of consumer’s surpl us has nothing to do with the problem 
of the admissibility of welfare economics as a significant part of 
economic theory since nobody has ever argued that the latter sub- 
ject presupposes the validity of consumer’s surplus. Can it then 
be said that consumer’s surplus if not necessary, is nevertheless a 
useful construct? Concerning this psychological question, no final 
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answer can be given. Historically the important propositions 
concerning increasing and decreasing cost industries, which are 
attributed to Marshall’s consumer’s surplus notions, may be said 
at best to have been incomplete derivations, and at worst may be 
said to be absolutely incorrect statements which, by a pun or play 
on words, seem to resemble the Pigouvian doctrine concerning in- 
dustries with external economies and diseconomies. In its earlier 
form the Pigouvian doctrine is close to that of Marshall, but from 
the writings of Knight and Pigou himself we know that earlier form 
to be quite wrong . 30 To Pigou the problem is not at all one of 
increasing or decreasing returns; it is only a question as to whether 
each unit is taking account of its full effect upon social magnitudes 
(other than prices) , or whether it is not. If it is not, and that is all 
that we mean by external economies, then there is of course need 
to interfere with the “invisible hand.” I have found nothing in 
the written work of Marshall which suggests that he ever saw 
matters in this way, and even if he had stumbled upon this result 
by means of consumer’s surplus, it would not be the first time that 
a correct theorem had been reached by incorrect, heuristic reasoning. 

Also it may be said that the merits or demerits of the concept 
in question have little to do with the applicability of partial equi- 
librium methods to any particular problem. As for its connection 
with the theory of index numbers, after the concept has been 
renovated and altered, it is simply the economic theory of index 
numbers in the Pigou, Konus, Haberler, Staehle, Leontief, Lemer, 
Allen, Frisch, Wald tradition . 31 

80 F. H. Knight, “Fallacies in the Interpretation of Social Cost,” Quarterly Journal 
of Economics , XXXVIII (1924), 582-606. A. C. Pigou, The Economics of Welfare, 
4th edition (London: 1932), chap, xi and appendix iii. 

81 In the Hicks derivations {Review of Economic Studies , vol. IX, no. 2, pp. 126-137) 

certain well-known theorems which are exact are derived as approximations. Also his 
most interesting result, that the difference between the Laspeyre and compensating 
variation is equal to a generalized substitution term, is exactly true, not simply for small 
movements, being a transcription of the familiar notion that the two terms differ in 
consequence of the curvature of the preference field. The one application to welfare 
economics in section 8 would be of interest only in connection with a (misdirected) 
attempt to measure welfare in a cardinal sense: to say whether one movement is better 
than the sum of the benefits of two other movements. And even if one were interested 
in cardinalization of welfare, this would not be the way to do it, for it can be shown that 
the value sums which are used in index number theory are of importance only for the 
qtiallatfvte dferiep^dn df dhassge they' indicate; m general (except m the trivial' case 
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If one were to begin afresh to give answers to the following 
problems, in no one of them would consumer’s surplus be necessary 
or desirable : Should Robinson Crusoe, a Socialist state, or a capital- 
ist economy build a particular bridge? Should indirect taxes be 
preferred to direct taxes? Should discriminatory prices be allowed 
if a uniform price will not keep an activity in business? Should the 
number of firms producing differentiated products be reduced, and 
in what way? Should a particular small industry be expanded or 
contracted by means of tax or subsidy? etc., etc. Aside from their 
extraneous inter-personal aspects, all of these questions can more 
conveniently (and more honestly !) be answered in terms of the con- 
sumer’s ordinal preference field. 

It is for these reasons that my ideal Principles would not include 
consumer’s surplus in the chapter on welfare economics except 
possibly in a footnote, although in my perfect Primer the concept 
might have a limited place, provided its antidote and alternatives 
were included close at hand. 

The Many Forms of Consumer’s Surplus 

The Marshallian expression of consumer’s surplus does not refer 
to any one thing but to at least half a dozen interrelated expres- 
sions. It is only too easy to accept tacitly one of these as primary 
and then to show that the others do not correctly measure this 
magnitude. Thus, Professor Viner 32 argues that Marshall is in- 
correct in using the area under the demand curve as an index of 
the gains from trade because this magnitude does not coincide with 
the amount which could be derived by an all or none offer, as if 

of expenditure proportionality) they cannot constitute even an arbitrary cardinal index. 
Some of these implications will be seen from an application of the Hicks result to what 
I have called the pure Marshallian case. Let us suppose that utility “really” is measur- 
able in a cardinal sense and is given by the additive logarithmic form mentioned above, 
so that demand for each good is unitary and independent of all other prices. Any two 
goods will nevertheless be substitutes in the Hicksian sense; if it seems strange that 
independence in the usual objective sense nevertheless implies substitutability in the 
latter sense, the reader is reminded of the formal definitions laid down by Hicks. Be 
cause each good is unaffected by a change in the other price, a change in both prices 
together leads to exactly the same cardinal change in utility as the sum of each change 
separately. Yet by the Hicks theorem of section 8 the combined reduction in price leads 
to a smaller “gain” than the sum of the two separate “gains," an eminently gratuitous 
conclusion. 

32 J. Viner, Studies in the Theory of Interwattiumal Trade (New York; 1937), chapter ix, 
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the latter were the primary and correct expression for consumer's 
surplus. As Bishop has shown , 33 the latter may be a worse ap- 
proximation than the former to the cardinal utility gain, which 
Bishop takes as the primary measure of consumer’s surplus. 
Hicks first took the all or none magnitude as being the primary one, 
but now adopts a fourth magnitude as primary, what he calls the 
compensating variation. Which is correct can only be a question 
of history of thought; which would Marshall have selected if con- 
fronted with a one or none offer? Unfortunately this is no longer 
an observably meaningful question. Although I think Marshall 
would have agreed with Bishop, looking upon the other measures 
as good or bad approximations, I prefer to treat no one as primary 
or privileged, but to give the relationships between each. 

This was done in brief but fairly exhaustive fashion in the 
Schultz volume essay, from which I append the following few' pages, 
copied verbatim except for minor changes . 34 

Before examining the Marshallian concept of consumer's sur- 
plus, let us consider the uses to which it is put. Among other 
things it is proposed as a measure of the gain (loss) of utility that 
results from a decrease (increase) in price of a single good. An 
attempt also has been made to apply it to the analysis of the burden 
involved in commodity taxation. It has been used to determine 
the maximum amount of revenue that a perfectly discriminating 
monopolist might exact from the consumer for a given amount of 
the good in question. 

Since only an ordinal preference field is assumed in the theory 
of consumer’s behavior, there is really little importance to be at- 
tached to any numerical measure of the gains from a price change. 
In particular, one cannot fruitfully compare the gain derived from 
a movement between tw r o given price situations with the gain be- 
tween two other price situations . 35 Moreover, all valid theorems 
relating to the burden of taxation can be stated Independently of 
any numerical measure of utility change. We should not be greatly 
perturbed, therefore, if the concept of consumer's surplus should 

33 Op. cit., p, 421, passim . 

34 Pages 87-90. 

35 One can, however, compare the gains derived from a change in the basic price 
situation with an alternative price change from the same basic situation, since this 
resolves itself into an ordinal comparison of the alternative new situations. The initial 
situation “cancels out” so to speak. 
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be found to be inadmissible. Its only advantage seems to lie in 
its deceptively easy two-dimensional representation. 

Consider an initial price and income situation, (pp, • • -, p n a , /*), 
and the corresponding amount of goods purchased, (xp, • - x n a ). 
For any selected utility index, <p, there will also be a given amount 
of utility, (p{X a ). Suppose that a change is made in but one price, 
Pi, and income is left unchanged. There will be new' amounts of 
every commodity, (.%'A • • •, x n b ), and of utility, <p{X h ), correspond- 
ing to the new prices and income, (pi b , • • •, p n b , I b ), or ( pi a , • • •, 
Pt’y • • • , pn a , I a ), where p? is less than pf. 

We are interested in the following magnitudes: 

1. The gain (loss) in utility resulting from the price change, or 

«,(*>) - ¥>(*“). 

2. The area between the demand curve of the zth good and the 
pi axis within the range of the price movement, or 



3. The amount by which the expenditure on the fth good in 
the new situation is exceeded by the maximum amount of money 
which the consumer would be willing to pay for in preference 
to trading at the old set of prices. (This may be negative if we 
are dealing with a price increase rather than a decrease.) Call 
this b Eab. 

4. The amount of extra income which the consumer would in- 
sist upon if he is to be as well off as in the new situation while 
consuming the old amount of Xi. Call this a E a b. 

5. The change in income which will make trading at the new 
set of prices as attractive as trading at the old set of prices with 
the initial income. Call this 6 A/ a &. 

6. The change in income which will make trading at the old set 
of prices as attractive as trading at the new set of prices with the 
initial income. Call this a Al a b. 36 

According to the Marshallian doctrine of consumer’s surplus, 
all six of these magnitudes are equal except for dimensional con- 

36 Interchanging subscripts changes algebraic signs. Thus, 

b Alab = ~ h Alab. a Alab , but not 3 4 5 6 A lab nor b E a b, can exceed I. 
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stants. We are explicitly warned, however, that his doctrine holds 
unqualifiedly only when the marginal utility of income is constant, 
and only if utilities are independent. I shall now examine the 
value of each of these magnitudes in four cases: (a) in the general 
unrestricted case of stable demand ; (b) under the first interpreta- 
tion of constancy of the marginal utility of income; (c) under the 
second hypothesis when the ith good is not the numeraire; and 
(d) under the second hypothesis when the ith. good itself has con- 
stant marginal utility of income. Only the most sketchy proofs 
will be indicated. 

In the general case we have the following relations: 



' dcp d%j 

v dxj dpi 

^ dpi 


r*i b n ( dx, \ 

C p i h 


(56) 

po dpl ~ 

Jfya 

m^Xidpi. 

n n 

b Al ab = max (X) Ps b Xj b — X Pi b Xj), where 

i i 

<p(X) = 

- vm. 

(57) 

= E PKxj b - xt), 

1 



(58) 

= E (Pf - P: b )x S a . 

1 



(59) 

If only the ith price changes, this becomes 




b A I ab S ( pf - pi b )Xi a . 



(60) 

Similarly, 




n n 

11 Al ai = min (E PT x j — E PT x f)< where 

i i 

<p(X) 

= v(X b ) 

(61) 


“A lab = X pj a (xj b — XT), (62) 

i 

and 

a AI a b = E pf(xj b - xt) = ( pT - pi b )xi b . 


(63) 
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It is impossible in the general case 87 to determine the relative 
magnitude of b AI ab and a AI a b, or of h E ab and a E ab . 

b Al ab = a AI ab ; b E ab = a E ab . (64) 

It can be shown in the two dimensional case that 

r v i 1 ' dx- 

b E ab = - i xdpi + f (pi — pi) WT- 1 dpi, (65) 

where pi is the price which would have to prevail for the consumer 
freely to select the batch of goods which he actually does consume 
when presented with an “all or none” offer by the perfectly dis- 
criminating monopolist. The first term on the right-hand side of 
equation (65) is the area under the demand curve. The second 
“correction” term may be of either sign. 38 It also follows from 
the definition of “A I ab that 


b Al ab = b E ab and a Al ab — a E ab - 


(66) 

In case (b) we find 



b E ab < b AIab < a A I ab S a Eab, 


(67) 

and 



•a/. > ~ **•> - - f 

VYlq 

(68) 

The following relations must be satisfied in case 

(c): 


m, 9 J Pi a r 



== ^JtLab ^ b&Iab “ a &Tab ^ a '-2ad* 

(i 1) 

(69) 


37 If we rule out the inferior good phenomenon so that demand is “normal,” 
& A T a b < a Alah and h Edb < a E a h. 

Actually, 

'Ala - “Alab = LXi(pl a , Pn a , <P) - Xi{pf, ■ ■ ■, Pi, ■ ■ Pn% 



For sufficiently small changes in price, concept 2 will always be halfway between any 
corresponding pair of A’s, or E 1 s, etc. 

38 In the “normal” two-dimensional case it will be of negative sign; i.e., a perfectly 
discriminating monopolist will be able to exact less than the area under the demand curve 
from the consumer. 

m The last of these inequalities will certainly hold in the two-good case. I have 
not developed a satisfactory proof that it holds for the w-dimensional case. 
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Although this is not the Marshallian interpretation, consumer’s sur- 
plus seems to be most justified in this case. However, the above 
equalities cannot hold simultaneously for every good. 40 

For case (d) equation (67) must hold with the possible exception 
of the inequality referred to in a previous footnote, but (69) is 
definitely false. 

Other results may be extended in at least two directions. More 
than one price may be permitted to change, and also income, with- 
out changing many of the inequalities. 41 This may be left to the 
interested reader as an exercise. Also we may attempt to derive 
rigorous inequalities in the n commodity case. This offers con- 
siderable complexities in connection with concepts (3) and (4). 

40 Case (c) is sufficient to insure the equalities of equation (69). Some of them may 
hold in other cases. 

41 In general, line integrals will replace simple integrals with the path of integration 
of the former a matter of no consequence. 


CHAPTER VIII 
WELFARE ECONOMICS 


Beginning as it did in the writings of philosophers, theologians, 
pamphleteers, special pleaders, and reformers, economics has al- 
ways been concerned with problems of public policy and welfare. 
And at least from the time of the physiocrats and Adam Smith 
there has never been absent from the main body of economic litera- 
ture the feeling that in some sense perfect competition represented 
an optimal situation. Of course, over time the exact form of this 
doctrine has undergone modification (not always in any one direc- 
tion), and there is considerable diversity in the attempted proofs 
(in the amazingly few places where rigorous proof was attempted). 

Although this doctrine is often thought to be conservative or 
reactionary in its implication and to reflect the “kept” status of 
the economist, it is important to emphasize that it was “radical” 
in the eighteenth century, and there is some evidence from events 
of the last decades (e.g., the T.N.E.C. and economists’ role and 
views with respect to Anti-Trust) that it has become a thorn in 
the side of what are usually thought of as conservative interests. 
Furthermore, some Socialist writers, who in their youth became 
interested in analytical economics, find in this doctrine a possible 
device for expediting planning in a socialized state. 

Early uncritical allegiance to this doctrine arose in part from 
the understandable eighteenth-century tendency to find teleological 
significance in the workings of what is after all an equilibrium sys- 
tem which is not devoid of aesthetic content regarded simply as a 
mechanism . 1 But it would be unfair to the older economists to 
believe that their case ended with a simple argument from design, 

1 It would be out of place here to discuss the relationship of this doctrine to that of 
“natural rights”; to that of competition as an immutable law with which man cannot 
interfere even if he should wish to; to the inverted doctrine of natural selection whereby 
the results of competition were judged to be best by means of a circular definition of the 
“fittest” as those who survive; to the Malthusian view that hardship and competition 
are necessary to bring out the “best” in a man; to the view that competition was good 
enough for our predecessors and therefore good enough for us; and other arguments 
designed to preserve the status quo. 
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even if such a charge can be sustained with respect to certain 
Epigoni. 

This can best be seen in the writings on International Trade 
where the issue of the tariff brings out most clearly the welfare and 
policy beliefs of economists, even down to the present day. For 
free trade is but one dramatic exemplification of pure competition, 
and in this field formal attempts at proof were made, or we can at 
least in many cases piece together the implicit beliefs of the author. 

1. Perhaps the most common reason for believing competition 
to be optimal stemmed from the recognition that no party could be 
hurt by exchange as compared to his position before trade, since 
he could always refuse to trade. Thus, trade is better than no trade; 
exchange is mutually beneficial; one party does not gain what the other 
loses. If we examine the argument carefully, we find that it does 
not really imply that pure competition is optimal, even though 
properly interpreted it can provide a case against prohibitive tariffs. 

2. A second more sophisticated argument, which includes the 
first and more, rests upon the fact that the equilibrium position 
reached in pure competition represents an optimum for each indi- 
vidual, consistent with his original endowment of commodities and 
the market situation with which he is confronted. But every indi- 
vidual may be making the best of a situation without implying 
that that best is very good or is optimal ; although each individual 
in pure competition takes price as given, for the market it is a 
variable, and it is quite possible that conditions other than pure 
competition might lead to better results in terms of any of the 
usual ethical notions. But, leaving aside all ethical notions, is it 
not equally clear that under (say) monopoly, both buyer and seller 
are doing the best that they can for themselves under existing 
market situations? The only distinguishing feature of pure com- 
petition, as compared to any other mode of behavior, is that the 
market conditions facing each individual are taken (by him) to be 
“straight lines” involving trade at unchanging price ratios. And 
it is precisely the question of the sense in which this is optimal 
which is left unanswered. 

It does not appear that Walras ever reached beyond this second 
stage of the argument. 2 His cardinal failing consists not so much 

2 Compare the very penetrating remarks of Wicksell on this point. K. Wicksell, 
Lectures <m Palvtwai Ecemomy (English translation, New York; Macmillan, 1934), 
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in the fact that he jumps from incomplete premises to sweeping 
conclusions, but in the fact that he is satisfied with this very limited 
kind of an optimum, which by a play on words he seems to confuse 
with the more usual and important senses in which perfect compe- 
tition is conceived to be optimal. 3 

3. Still a third stage of reasoning attempts to show, not that 
each individual is made best off by competition since this is im- 
possible unless each can take all, but that in some sense the sum 
total of satisfactions is maximized, that perfect competition effects 
an ideal compromise of mutual benefit, or in its most nebulous 
form that free trade (perfect competition) maximizes world (all 
individuals’) income. Of course, this involves the notion of adding 
the utilities of different individuals, of somehow being abl£ to com- 
pare and weight the, utilities of different individuals. Although 
the marginal utility economists, with the exception of J evens who 
made an interesting slip in connection with the concept of the 
utility of the “ trading body,” knew that it was not necessary to 
make interpersonal comparisons of utility in describing exchange 
under pure competition, they nevertheless did not have the modern 
reticence about making such assumptions. 

Launhardt seems to have been the only economist who at- 
tempted to give rigorous proof of this theorem. As Wicksell has 
pointed out in the section just cited, his argument is mathematically 
and logically false. Yet he must be given credit for having made 
an attempt at rigor, and we can learn more from his unambiguous 
failure than from many pages of fuzzy literary effusion. 

To many modem economists the difficulty with this third line 
of reasoning lies in the fact that it assumes that the utilities of 
different individuals can he compared, in fact added together. 

3 If interpreted literally, he would seem to imply that each and every person is made 
better off by perfect competition, a conclusion which, as Wicksell observes, goes farther 
than the free traders themselves, “for the latter have not denied that a restriction of free 
competition might be most advantageous to a small privileged minority.’' Ibid. , p. 76. 

Actually there is one qualification in Walras’ argument which makes it not so much 
wrong as trivial. He maintains that perfect competition creates a maximum of satis- 
faction, consistent with trading at uniform prices . Waiving the trivial objection that 
under nondiscriminating monopoly trading is also done at uniform prices, I find this 
confusing. Except for positions of multiple equilibria which we may provisionally 
ignore, the equilibrium position under competition is uniquely determined. Instead of 
being the optimum condition under these conditions, it is the only one possible. Thus, 
it is the worst position as well as the best. 
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This they would regard as “unscientific.” But to the preceding 
■ generation of economists, interindividual comparisons of utility 

were made almost without question; to a man like Edgeworth, 

} "i steeped as he was in the Utilitarian tradition, individual utility — 

nay social utility — was as real as his morning jam. And with 
Marshall the apostrophe in consumers’ surplus was always after 
If’ the 5. 

Both Marshall and Wicksell objected to what they considered 
to be a prevalent notion that perfect competition leads to the 
maximum of satisfaction. Both enter as a minor objection the fact 
i j , ' that there may be multiple positions of equilibrium ; actually this 

is largely irrelevant since each stable equilibrium might be a rela- 
tive maximum as compared to points in its immediate neighbor- 
hood ( im kleinen ) even if it were not the maximum maximorum. 
But their major objection consists in the fact that with existing 
distributions of wealth and ability, the processes of imputation 
under competition will give rise to great inequalities in the personal 
distribution of income so that unless individuals are very different 
in their natures the marginal utilities of income will not be equal 
for each individual. Both recognize that in these circumstances 
any interference (a la Robin Hood) with perfect competition which 
transfers income from rich to poor would be beneficial. 

4. It might be thought that at this stage Marshall and Wicksell 
would enunciate a fourth proposition, that exchange under perfect 
competition is optimal provided the distribution of income is 
optimal. In the case of Wicksell the proof which he gives ( Lectures , 
p. 80) to show why perfect competition is not optimal when the 
distribution of income is inappropriate paves the way for a proof 
as to why perfect competition is optimal when the distribution of 
income is appropriate. 4 Wicksell also realizes that when the dis- 
tribution of income is not optimal, the creation of a condition of 

4 Actually Ms proof seems to suffer from one minor drawback. In effect, Ms evalua- 
tion of the change in utility resulting from a change in price from the competitive level 
assumes that in the noncompetitive situation all individuals are still on their offer curves. 
Strictly speaking, this is not possible. It would perhaps be correct to say that his proof 
(with slight modifications) shows that transfer of goods or income from one individual 
to another could not improve the competitive conditions. There is also an unfortunate 
minor slip in expression, perhaps in translation, in the statement that “free competition 
would secure a maximum satisfaction to all parties to the exchange.” {Ibid., p. 81, my 
italics.) Actually, it is the sum of all and not the utility of each that is maximized. 
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imperfect competition may improve the situation, but that this 
is not the best way of improving the situation, since perfect com- 
petition is a necessary condition to “maximize production.” I 
return to this point a little later in this historical review. 

Although inappropriateness in distribution is thought by Mar- 
shall to render the competitive position suspect, he was of the 
belief that many decisions involve alternatives which affect all 
classes more or less equally. He has been criticized for this too 
facile assumption, but it is nevertheless true that many modern 
economists, and this includes some purists, by use of the principle 
> of sufficient reason (or is it insufficient reason?) argue in such terms 
for or against a price change of a commodity which is not presumed 
to relate more to the poor than to the rich. 

However, aside from problems raised by the inappropriateness 
of the distribution of income, Marshall had important objections 
to the equilibrium position realized under perfect competition. 
These objections resulted from his analysis of consumers’ surplus, 
an analysis which was regarded as almost the most significant 
contribution of his Principles. By a comparison of geometrical 
areas he arrived at the conclusion that increasing cost industries 
would be pushed to too great a margin under competition, and that 
the output of decreasing cost industries would be too small under 
competition. From the modern standpoint it is clear that these 
conclusions are true in only a very limited sense. And if Marshall 
did arrive at conclusions which are not completely wrong, it is 
nevertheless clear that he arrived at them for the wrong reasons. 

It is not easy in a few paragraphs to delineate the various faults 
in the Marshallian reasoning. In the first place, his exposition in 
Book V, chapter xiii, is extremely sketchy, and, in the second 
place, it is impossible to avoid the somewhat extraneous difficulties 
arising from the admittedly unsatisfactory treatment of decreasing 
cost by Marshall. However, the latter is simply the most dramatic 
exemplification of the paradox that Marshall, with whom the doc- 
trines of partial equilibrium and industry analysis are inseparably 
associated, nowhere presents a complete or satisfactory theory of 
the industry in its relationship to the firms which make it up. If 
anyone doubts this, he need only compare the treatment of these 
problems by Pigou, Marshall’s shining pupil, in his 1912 Wealth 
and Welfare with his treatment in the late editions of the Economics 
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of Welfare or with that of Viner in the Zeitschrift article cited in 
chapter iii. 

Another inadequacy, but one that can be easily remedied, lies 
in the fact that Marshall neglects producers’ surplus instead of 
treating this symmetrically with consumers’ surplus, so that it is 
possible by the reasoning of page 468, footnote 2, to arrive at the 
curious conclusion that industries of increasing costs should in 
'many cases be contracted even if there are no decreasing cost 
industries to expand. 

There seems to be no point in discussing the Marshallian reason- 
ing at greater length except to note that Pigou nowhere makes 
essential use of the concept of consumers’ surplus in his welfare 
analysis.^ He originally enunciated essentially the Marshallian 
conditions with respect to increasing and decreasing cost indus- 
tries, but as a result of the criticisms of Allyn Young, Knight, and 
Robertson he seriously modified these conclusions. 5 In its final 
form his doctrine holds that the equilibrium of a closed economy 
under competition is correct except where there are technological 
external economies or diseconomies. Under these conditions, since 
each individual’s actions have effects on others which he does not 
take into account in making his decision, there is a prima facie 
case for intervention. But this holds only for technological factors 
(smoke nuisance, etc.) ; changes in factor prices resulting from the 
expansion of demand by firms in an industry represent transfers 
which are irrelevant for determination of ideal output. (In all 
fairness it should be admitted that the correct use of consumers’ 
surplus and producers’ surplus might have helped to avoid error 
in this regard.) 

There would be no reason to rake over these old ashes were it 
not for the fact that Professor Hicks has recently lent the weight 
of his authority to the view that the doctrine of consumers’ surplus 
has a claim to importance in the welfare field. 6 As I have indicated 
in chapter vii, careful perusal of his argument simply confirms my 

5 F. H. Knight, “Fallacies in the Interpretation of Social Cost,” Quarterly Journal 
of Economics (1923). Reprinted in The Ethics of Competition (New York: Harper, 1935), 
pp. 215-236. Allyn Young, “Pigou’s Wealth and Welfare,” Quarterly Journal of 
Economics , XXVII (1913), 672-686. D. H. Robertson, “Those Empty Boxes,” Eco- 
nomic Journal , XXXIV (1924), 16-31. 

6 J. R. Hicks, “The Rehabilitation of Consumers’ Surplus,” Review of Economic 
Studies, VIII (1941), 108-116. 
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belief that the economist — mathematical, literary, beginner, expert 
— had best dispense with consumers' surplus. It is a tool which 
can be used only by one who can get along without its use, and 
not by all such. As Hicks admits, it is not useful in the exposition 
of the conditions of “equilibrium” or “optimum.” And even in 
the case of a Crusoe economy where the problems raised by many 
individuals can be sidestepped, it is usually only devised to give 
the loss in utility resulting from a deviation from the optimum in 
the amount of one good. 

In this connection its principal conclusion states that the 
(second order) change in utility resulting from a deviation in the 
amount of one commodity, other commodities continuing to be 
optimally adjusted, depends upon the amount of the discrepancy 
in that good times the discrepancy in the equilibrium condition. 
This conclusion is no more plausibly derived from consumers' sur- 
plus than from simple intuition. And if one probes deeper, one 
finds in any case that the theorem is incorrect even to the order of 
infinitesimals (second) at which the argument is pitched. 

Thus in the most favorable case to consumers' surplus where one 
commodity, x n +i, has literally constant marginal utility so that 

U = L(x 1 , X$, •••tXn) + tnXn+i, (1) 

and where the goods can be converted into each other at constant 
technological rates as indicated by the relation 

n-f" 1 

X fax* = c, (2) 

1 

for this conclusion to be correct it would be necessary that the 
change in utility resulting from a small change in the amount 
of xi be 

S 2 U = 0 + ILuSxi 2 . ■ (3) 

Actually, by a simple extension of the reasoning in chapter iii, it is 
given by 

S 2 U = 0 + |EE LisSXiSxj. (4) 

1 1 

If we proceed to higher orders of infinitesimals, the case is worse, 
and the same can be said if we drop the unrealistic assumption 
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about the utility of the numeraire, and if the original position is 
not one of equilibrium. 

Even if consumers’ surplus did give a cardinal measure of the 
change in utility from a given change, it is hard to see what use 
this could serve. Only in the contemplation of alternative move- 
ments which begin or end in the same point could this cardinal 
measure have any significance, and then only because it is an indi- 
cator of ordinal preference. Such situations are comparatively 
rare as far as questions of social policy are concerned, being in the 
nature of the somewhat academic question as to whether the intro- 
duction of a little monopoly evil into one industry, all others being 
competitive before and after the change, is better or worse than 
introducljon of some monopoly in another. 

In connection with monopolistic competition the frequent oc- 
currence of decreasing cost and indivisible initial costs inevitably 
raises problems of an “all or none” character. Waiving the diffi- 
culties arising from many individuals, we see that correct decisions 
necessitate reference to ordinal indifference curves and to nothing 
else. Certain difficulties connected with the determination of the 
optimal amount of differentiation of product were properly posed by 
Chamberlin, Cassels, and Kahn without the use of consumers’ 
surplus. 7 

We may conclude from this lengthy digression that after making 
due allowances for external economies and for certain omissions in 
their expositions, the founders of neo-classical economics believed 
that perfect competition led to an optimum in “exchange and 
production” provided that the distribution of income was appro- 
priate. But they did not believe that incomes imputed by the 
competitive process as of a given historical distribution of owner- 
ship of factors of production and personal abilities was in any sense 
the best 'one, and not subject to modification by appropriate 
mechanisms. 

5. Before analyzing the problems encountered under the head- 
ings of optimum “production” and optimum “exchange,” I should 
like to note briefly the existence of economists who attempted to 

7 B. H, Chamberlin, The Theory of Monopolistic Competition (3rd ed*; Cambridge: 
Harvard University Press, 1938), p. 94. j. M. Cassels, “Excess Capacity and Monopo- 
listic Competition,” Quarterly Journal of Economics, LI (1937), 426-443. R, F. Kahn, 
“Some Notes on Ideal Output,” Economic Journal , XLV (1935), 1-35. 
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establish the stronger position that incomes imputed under compe- 
tition were actually right and best. Thus at an earlier date, 
Bastiat, whose powers of analysis were hardly of the highest order 
even in his day, hoped to show that beneficient competition would 
lead to “. . . an amount of utility and enjoyment, always 
greater, and more and more equally distributed. . . 8 

Confronted with the undeniable fact of considerable inequality 
of income and possessing the latent Western European prejudice 
against inequality, writers had either to refer to a future day when 
competition -would achieve better results, or attribute existing in- 
equalities to the admittedly large institutional deviations from 
competition, or to look for inequalities among individuals’ charac- 
teristics (including property ownership) to justify differences in 
income. 

To anyone with knowledge of the world the perverse relation- 
ship between exertion and income made necessary a revision of the 
classical real cost doctrine in its simplest form, although the promo- 
tion by Senior of abstinence to the rank of a full-bodied real cost 
helped to bolster that doctrine. But ultimately refuge was found 
in the undeniable fact of differences in personal “ability” and the 
related doctrine of non-competing groups. This raised many ques- 
tions as to the extent to which the relevant abilities were or were 
not “acquired characteristics” and the degree of correspondence 
between the distribution of abilities and income. That much of 
this discussion was meaningless and from most points of view ir- 
relevant does not detract from its significance from the standpoint 
of the history of ideas. 

Among analytical economists J. B. Clark 9 is best known for 
his belief that not only will factors of production have imputed to 
them their marginal productivity under competition, but that this 
is a “natural law” which is “morally justifiable” since this is their 
“actual,” “specific” product. Indeed Clark himself considered 
that the principal way that his independently discovered marginal 
productivity doctrine represented an improvement upon Von 
Thiinen lay in his demonstration of its ethical fairness as compared 
to the latter’s belief that the doctrine involved exploitation* That 
Clark, who clearly states the distinction between personal and 

8 F. Bastiat, Harmonies of Political Economy (2nd ed.; Edinburgh, 1880), p. 301. 

9 J. B. Clark, The Distribution of Wealth (New York, 1899). 
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functional income, should have thought that he had proved the 
ethical fairness of income determination under competition is 
simply a reflection of the fact that where emotional beliefs in right 
and wrong enter into analysis, it is usually not to the advantage of 
the latter. As we shall see, even if all income resulted from per- 
sonal services, Clark’s proposition is not consistent with widely 
held ethical views ; and if it is accepted as a definition rather than 
a theorem, it will be found to be consistent with no unambiguous 
ethical evaluation of different individuals’ welfare. Nevertheless, 
it has considerable appeal, especially in a frontier society, where 
each individual could be thought of as working by himself under 
conditions where “his” product could be identified. Analytically, 
it is almost precisely in these terms that Clark first perceived his 
doctrine, going with painful slowness from the (broad) “zone of 
indifference” to the concept of the internal margin. 

6. While Wicksell and Marshall held that competition would be 
optimal if the distribution of income were appropriate, it was left 
for Pareto 10 to take the stronger position that competition pro- 
duces a maximum d’utilite collective regardless of the distribution of 
income, and indeed even if the utilities of different individuals were 
not considered to be comparable. An optimum position in this 
sense was defined by the requirement that there should not exist any 
possible variation or movement which would make everybody better off. 

His discussion is not easy to follow, and it has not received 
attention from economists commensurate with the importance 
which he attached to it. Yet it forms the basis of many modern 
notions, and it led directly to the important contribution of Barone. 
Pareto also seems to have been one of the first to discuss criteria 
of planning under collectivism. 

Pareto’s exposition is complicated by the fact that he works 
with differentials or first order (infinitesimal) variations. This was 
a very common practice with mathematicians and physicists of 
the nineteenth century, and because of its formal heuristic con- 
veniences is still often used today. And under proper qualifica- 
tions this practice can be given a rigorous, unambiguous basis. 
Nevertheless, where delicate problems of interpretation are in- 
volved it often obscures more than it reveals, especially if the prob- 

10 V. Pareto, Manuel D'Economie Politique (1909), chap, vi; also the Mathematical 
Appendix, par. 89, passim. 
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lem arises as to whether any given differential expression is an 
“exact” differential 

Pareto is unwilling to add the utility or ophelimite 9 of different 
individuals together, either in toio , i.e. (U 1 + U 2 + * * •)> or f° r 
small variations (dll 1 + dll 2 + * * *)• For that would involve a 
comparison of different individuals’ utility, and in addition it would 
depend upon the particular cardinal index of ophelimite ’ selected for 
each. But he was interested in comparing the summed variation 
in the utility of each, after these expressions have been divided by 
the marginal utility of any one good, a, selected as numeraire. 
For if we examine the dimensionality of the expression 

JflW + ~8U*+ •••, ( 5 ) 

where as usual subscripts represent partial differentiation, but 
where the superscript is taken to indicate different individuals, we 
shall see that this has the dimensionality of the good, a , and 
nothing else. 

Pareto attempts to show that if the original position is one of 
equilibrium under perfect competition, then no possible variations, 
consistent with the fundamental scarcity of goods and given tech- 
nology, can make the above expression positive. If it could, he 
says, it would be possible to arrange things so that each term in 
the expression could be made positive, and then everyone would 
be better off. But the expression cannot be made positive. Ac- 
tually, regarded as a differential expression (of the first order), the 
above expression can be shown to be zero, in consequence of the 
fact that each commodity is sold at minimum unit cost (propor- 
tionality of marginal products, etc., 11 and in consequence of the 
tangency of each individual’s indifference curves to mutual price 
exchange loci. If recourse is had to differentials of higher order, 
the secondary maximum-minimum conditions of firms and indi- 
viduals will guarantee that the expression (5) will be negative for 
all finite deviations from the competitive position, or so Pareto 
attempted to show. 

Although Pareto’s treatment is somewhat sketchy and in need 
of expansion, part of which Barone later provided, the main out- 
lines are reasonably clear. But in connection with the funda- 

11 Ibid*, p. 646. 
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mentals of interpretation of the significance of his maximum, there 
do arise certain problems. First, can the differential expression of 
(5) be regarded as the exact differential of some expression? Ac- 
tually, Pareto later gives a name to this expression, calling it 8 U; 
but is there an expression U (social utility?) of which this is an 
exact differential? Pareto does not tell us, but presumably he 
would answer, no, if he were on guard when the question was asked. 
As we shall see, Barone does work with an expression whose differ- 
ential corresponds to (5), but he clearly recognizes that it is a 
construction, not involving the dimension of utility, but rather 
that of the numeraire good. 

But the most important objection to Pareto’s exposition is his 
lack of emphasis upon the fact that an optimum point, in his sense, 
is not a unique point. 12 If transfers of income from one individual 
to another are arbitrarily imposed, there will be a new optimum 
point, and there is absolutely no way of deciding whether the new 
point is better or worse than the old. His optimum points consti- 
tute a manifold infinity of values. This locus can be obtained 
under regimes quite different from perfect competition (e.g., by 
multilateral monopoly). Within Pareto’s system it is impossible 
to decide, by his differential criterion or otherwise, which of two 
points on what may be called the “generalized contract locus” are 
better, or even that a given movement off the contract locus and 
hence to a non-optimal point is good or bad. Actually in terms 
of the wider reference schemes of ordinary economic thought, such 
a movement may be deemed eminently desirable. But Pareto 
shows that however desirable such a move may be, there exists still 
a better move, which for the same (ordinal) amount of harm to 
those who “should” be harmed, will yield more benefit for the 
worthy ones who are to be benefited. This is an important 
contribution. 

7. In a masterly article, written in 1908 in Italian, but not 
translated into English until 1935, 18 Barone developed further and 
in greater detail the Paretian conditions of optimum, especially as 

12 In his earlier Cours discussion, II, 90 fL, he explicitly assumes the distribution of 
income to be “ convenable ,” but in his later Manuel treatment the dependence of the 
optimum point on the initial distribution of income, and hence its lack of uniqueness, 
is not brought to the foreground. 

13 Reprinted as an Appendix in F. A. Hayek, ed., Collectivist Economic Planning 
(London: Routledge, 1935), pp. 245-290. 
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they relate to the planning of production under collectivism. By 
avoiding all mention of utility and indeed without introducing even 
the notion of indifference curves, Barone was able to break new 
ground along lines which have in recent years become associated 
with the economic theory of index numbers. 

Unlike most of the writers discussed above, Barone is unsatisfied 
with the statement that free competition maximizes product, or 
sums of product, which can then be distributed in any given 
fashion. Heterogeneous products cannot be added. Further- 
more, leisure may be preferred to the maximization of output. It 
is significant that those writers who do not explicitly introduce the 
equations of general equilibrium should gloss over the definition of 
“product” which is allegedly maximized. Thus, Wicksgll 14 con- 
fines his demonstration to a case where the same product can be 
provided by different sources, and only in this case shows that 
various marginal conditions are optimal. The same is true of the 
very excellent treatment by Knight, in which the movement of 
goods over alternative roads is analyzed for optimum conditions. 15 

It is remarkable that Professor Pigou, who reaches substantially 
correct conclusions, never squarely meets the problem of the defini- 
tion of social product. His index number discussion represents an 
important contribution in its own right, but it is offered at best as 
an approximate criterion or indicator of changes in individual and 
social welfare. He would not seriously suggest that the thing to 
be maximized is the money value of output deflated by an ideal 
index of prices. Nor will the more exact limits of index number 
theory such as are discussed in chapter vi avail. 

Barone proposes to add different products after they have been 
weighted by their respective prices; it is usually taken to be con- 
venient to express these prices as ratios to the numeraire good, a. 
For Barone, productive services can be treated simply as alge- 
braically negative goods and services. Thus, decisions between 
more or less work can be included in his welfare system. Then if 
the sum total of each good consumed by all individuals together is 
written as 

A = a 1 + a? + 

B = b 1 + 6 s + -. • • 
etc., 

14 Wickseil, op. ciL, p. 140, passim. 

15 Knight, op. cit. f p. 219. 
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and recalling that 

6U* = U.W 4- WSb* + •••, (7) 

we can write Pareto’s equation (5) above in the equivalent form 

l(So’ + So 2 + • • •) + Pb(Sb l + hb* + • • •) + • • • 

— 5 A. d* pb&B -{-•■•. (8) 

In passing from (4) to (8) , use is made of the fact that the ratios of 
marginal utilities of two goods for each individual are equal to their 
price ratios. Barone himself does not use this terminology, but 
no doubt he would have to if the connection with Pareto were to 
be shown. 

The expression in (8) can be regarded as the variation in the 
following expression when prices are regarded as constants: 

4> = A +p b B + • • •. (9) 

If productive services were known to be constant so that they could 
be neglected, (9) would equal (except for dynamic factors involving 
capital which we can ignore) money value of national product. If 
all productive factors are included, it will represent the net differ- 
ence between value of consumers’ goods and the return to produc- 
tive services. Under many assumptions this quantity must be 
zero under the full conditions of perfect competition. 

Barone shows that perfect competition maximizes this expres- 
sion, prices being taken as fixed parameters, i.e., any variation from 
a condition of price equal to minimum cost must make 54* as given 
in (8) negative. Thus, if we are at conditions other than perfect 
competition, with (8) not equal to zero for all possible variations, 
it will be possible to specify a movement which will make 5$ posi- 
tive. But we can think of 5$> as being made up of the sum of a 
similar expression referring to each individual 

5$ = Sip 1 -f- 8<p* +•'•• = (So 1 -T phbb 1 -j- • • •) 

+ (So 2 + p b 8b 2 +•••)+ (10) 

If for any movement, the total 5$ is positive, it is not necessary 
that each and every one of the individual’s 8<p be positive; but it 
is necessary that those which are positive should outweight those 
which are negative. Thus, those who are hurt could be compen- 
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sated by those who are helped, and there would still be a net gain 
left to be parceled out among the individuals. 

This is essentially the gist of the Barone argument. The one 
point which will occur to the critical reader is the fact that arbi- 
trary prices are assumed in evaluating the expression to be maxim- 
ized. Which prices are to be used? Barone employs the prices 
which prevailed before a contemplated break with the conditions 
of competition are made, and this suffices if one merely wishes to 
demonstrate that not all individuals can be improved by any 
departure from competition. 16 

Unlike Pareto, Barone satisfies himself with deriving optimal 
production conditions without going into the fact that under compe- 
tition no additional individual exchanges of fixed amounts,of goods 
would be mutually profitable. No doubt this oversight resulted 
from his wish to avoid the use of indifference curves and utility, 
but even without these constructions, by the use of the index num- 
ber notions which he pioneered, the enlarged conditions of exchange 
could have been included. It is a tribute to this work that a third 
of a century after it was written there is no better statement of the 
problem in the English language to which the attention of students 
may be turned. 

8. The next writer who deserves our attention is A. P. Lerner, 
who comparatively recently developed, presumably independently, 
the Paretian conditions which show that the marginal equivalences 
realized by perfect competition lead to an optimum of production 
and exchange in the special senses discussed above. 17 Actually, in 
the field of production his statement of the problem is slightly 
different from that of Pareto and Barone. They showed that a 
movement to conditions of perfect competition in the field of pro- 
duction and cost could make everyone better off because they could 
be given more of every good. But they still worked with indi- 
viduals. Even in a collectivized state where the existence of the 
individual is not assumed, the Lerner formulation of the sense in 
which output is optimal would still hold : the marginal equivalences 
of competition are such as to give a maximum of any one product for 

18 Actually, Barone discusses varying prices in a passage which seems obscure to me# 
Op. cit . , p. 255. 

17 A. P. Lerner, “The Concept of Monopoly and the Measure of Monopoly Power/ 1 
Review of Economic Studies , I (1934), 157-175. “Economic Theory and Socialist 
Economy,” Review of Economic Studies , II (1934), 51-61. 
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given specified amounts of all others. This is almost identical with 
the Pareto-Barone production propositions, but not quite. 

Professor Hotelling, also presumably independently, developed 
in two articles 18 conditions closely related to the Pareto production 
and exchange conditions of optimum. In particular, he has in- 
sisted upon the fact that marginal rather than average costs provide 
the appropriate basis for pricing, and he developed the dramatic 
applications of this to the problem of railroad rates and decreasing 
cost public utilities in general. On the analytic side each thing 
which he sets out to prove he does prove with great elegance and 
generality, but his fundamental primitive assumptions are only 
implicitly related to each other and to the equations of general 
equilibrium. Moreover, his welfare work really falls into two dis- 
tinct headings; on the one hand, that of the first article cited and 
much of the second article, and on the other hand, that of the 
second section of his second article referring to the “fundamental 
theorem” (especially pp. 248-256). Roughly these two diverse 
contributions of Hotelling fall respectively under the headings of 
optimal production and optimal exchange conditions; or, on the 
analytic side, to the difference between firms with unlimited 
budgets and the consumer with limited budget, to each of which 
fields Professor Hotelling has contributed much in the way of 
demand analysis. This dualism explains why so discerning a 
reader as Professor Frisch should have been puzzled by the 
Hotelling proof. 19 

18 H. Hotelling, “The General Welfare in Relation to Problems of Taxation and of 
Railway and Utility Rates,” Econometrica , VI (1938), 242-269, “Edgeworth’s Taxation 
Paradox and the Nature of Demand and Supply Functions,” Journal of Political Econ- 
omy , XL (1932), 577-616. 

19 Space cannot permit a detailed examination of the exact steps in the Hotelling 
reasoning, this being particularly unnecessary since it is clear that his conclusions are 
impeccable. In the original specification of his system Professor Hotelling essentially 
generalizes the Dupuit-Marshall partial equilibrium set-up to many interrelated indus- 
tries. However, unless we confine ourselves to the production problem alone, this will 
not lead to the equations of general equilibrium. These require the addition of the 
special demand functions of consumers for goods and their supply functions of productive 
services. In the mixed consumer-firm system the integrability conditions which give 
meaning to Hotelling’s line integral, dead loss, and price potential (equal to the Barone 

function) are not satisfied. Nor when it came to interpretation would it matter for 
the validity of the Pareto-Barone-Lerner conditions if they were. While Hotelling 
gives separate consideration to consumers when discussing excises, the two treatments 
are never adequately integrated. 
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9. The last writer to be mentioned Is Professor A. Bergson, 20 
He Is the first who understands the contributions of all previous 
contributors, and who Is able to form a synthesis of them. In 
addition, he is the first to develop explicitly the notion of an ordinal 
social welfare function in terms of which all the various schools of 
thought can be interpreted, and in terms of which they for the first 
time assume significance. In view of his own very generous ac- 
knowledgments of the work of others, even where he himself had 
independently rediscovered many basic theorems in an improved 
form, it is regrettable that his contribution has received so little 
notice. No doubt this stems in part from the mathematical 
character of his exposition, and to the fact that he uses the rather 
difficult notation of differentials throughout. The analysis that 
follows is simply an enlargement and development of his impor- 
tant work. 21 


The Social Welfare Function 

It is fashionable for the modern economist to insist that ethical 
value judgments have no place in scientific analysis. Professor 
Robbins in particular has Insisted upon this point, 22 and today It is 
customary to make a distinction between the pure analysis of 

20 A. Bergson, “A Reformulation of Certain Aspects of Welfare Economics,” 
Quarterly Journal of Economics , LI I (1938), 310-334. 

21 In recent years Kaldor and Hicks have given an exposition of certain aspects of 
welfare economics. N. Kaldor, “Welfare Propositions in Economics, ” Economic Journal , 
XLIX (1939), 549-552. J. R. Hicks, “Foundations of Welfare Economics,” Economic 
Journal , XLIX (1939), 696-712. Mention should also be made of an important article 
which indicates the modification in the Pigouvian analysis necessitated by the considera- 
tions of monopolistic competition. R. F. Kahn, “Some Notes on Ideal Output,” 
Economic Journal , XLV (1935), 1-35. A convenient compact summary of welfare 
economics is provided by O. Lange, “The Foundations of Welfare Economics,” Eco- 
nometrica , X (1942), 215-228. An advance in the discussion is represented by T. 
Scitovsky, “A Note on Welfare Propositions in Economics,” Review of Economic Studies , 
IX (1941), 77-88. Because discussions of Free Trade illuminate the beliefs of economists 
upon these matters, and because this subject provides a convenient illustration, it 
would be desirable to review its literature. However, reference can only be made here 
to the survey in J. Viner, Studies in the Theory of International Trade (New York: Harper, 
1937); to T. Scitovsky, “A Reconsideration of the Theory of Tariffs,” Review of Eco- 
nomic Studies , IX (1942), 89-110; P. A. Samuelson, “Welfare Economics and Inter- 
national Trade,” American Economic Review, XXVIII (1938), 261-266; “The Gains from 
International Trade,” Canadian Journal of Economics and Political Science , V (1939), 
195-205. 

22 L. Robbins, An Essay on the Nature and Significance of Economic Science (London, 
1932). 
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Robbins qua economist and his propaganda, condemnations, and 
policy recommendations qua citizen. In practice, if pushed to ex- 
tremes, this somewhat schizophrenic rule becomes difficult to ad- 
here to, and it leads to rather tedious circumlocutions. But in 
essence Robbins is undoubtedly correct. Wishful thinking is a 
powerful deterrent of good analysis and description, and ethical 
conclusions cannot be derived in the same way that scientific 
hypotheses are inferred or verified. 

But it is not valid to conclude from this that there is no room 
in economics for what goes under the name of “welfare economics/ ’ 
It is a legitimate exercise of economic analysis to examine the con- 
sequences of various value judgments, whether or not they are 
shared by the theorist, just as the study of comparative ethics Is 
itself a science like any other branch of anthropology. If it is 
appropriate for the economist to analyze the way Robinson Crusoe 
directs production so as to maximize his (curious) preferences, the 
economist does not thereby commit himself to those tastes or in- 
quire concerning the manner in which they were or ought to have 
been formed. No more does the astronomer, who enunciates the 
principle that the paths of planets are such as to minimize certain 
integrals, care whether or not these should be minimized; neither 
for all we know do the stars care. Our above historical review 
should show that there is meaty and weighty content to the field 
of welfare economics, without invoking new methods in economic 
thought. In saying this, I do not mean to imply that the field of 
welfare economics has scientific content because a number of its 
theorems do not require inter-personal comparisons of utility; this 
after all is a mere detail. That part which does involve inter- 
personal comparisons of utility also has real content and Interest 
for the scientific analyst, even though the scientist does not con- 
sider it any part of his task to deduce or verify (except on the 
anthropological level) the value judgments whose implications he 
grinds out. In the same way, the mathematical theory of prob- 
ability accepts as a primitive undefined assumption whose validity 
is not its concern the initial specification of “equally likely” events, 
the measure of various “ ‘classes” or the “collective,” and it then 
proceeds to grind out the mathematical implications of these and 
subsidiary hypotheses. It is only fair to point out, however, that 
the theorems enunciated under the heading of welfare economics 
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are not meaningful propositions or hypotheses in the technical 
sense. For they represent the deductive implications of assump- 
tions which are not themselves meaningful refutable hypotheses 
about reality. 

Without inquiring into its origins, we take as a starting point 
for our discussion a function of all the economic magnitudes of a 
system which is supposed to characterize some ethical belief — that 
of a benevolent despot, or a complete egotist, or “all men of good 
will,” a misanthrope, the state, race, or group mind, God, etc. Any 
possible opinion is admissible, including my own, although it is best 
in the first instance, in view of human frailty where one’s own 
beliefs are involved, to omit the latter. We only require that the 
belief be such as to admit of an unequivocal answer as tc whether 
one configuration of the economic system is “better” or “worse" 
than any other or “indifferent,” and that these relationships arc 
transitive; i.e., A better than B, B better than C, implies A better 
than C, etc. The function need only be ordinally defined, and it 
may or may not be convenient to work with (any) one cardinal 
index or indicator. There is no need to assume any particular 
curvature of the loci (in hyper-space) of indifference of this func- 
tion. Utilizing one out of an infinity of possible indicators or 
cardinal indices, we may write this function in the form 

W = W(z u s 2 , ■■■), (11) 

where the s’s represent all possible variables, many of them non- 
economic in character. 

Between these z’s there will be a number of “technological" 
relations limiting our freedom to vary the s’s independently. Just 
what the content of these technological relations will be depends 
upon the level of abstraction at which the specifier of the value 
judgments wishes to work. If he is an out and out Utopian he may 
wish to ignore various institutional relations regardless of their 
empirical importance; indeed, he may go all out and repeal the laws 
of conservation of energy and widen greatly the technological 
productivities of the system. On the other hand, he may wish to 
take as fixed and immutable all social and economic institutions 
except those relating to the Central Bank. (Indeed those of fatal- 
istic temperament may regard the restraints to be so numerous as 
to leave no problem of choice.) In other words, the auxiliary 
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constraints imposed upon the variables are not themselves the 
proper subject matter of welfare economics but must be taken as 
givens. 

Subject to these constraints, which may be written in the most 
general way as 

g*(si, z 2 , • • •) = 0, (12) 

there will presumably be an upper bound to W (even though no 
unique value of the s’s need correspond to this maximum level). 
If certain assumptions of regularity are made, it would be possible 
to indicate formal conditions for the maximum, involving La- 
grangean multipliers, matrices, rank, and quadratic forms definite 
under constraint, etc. However, there is no particular point in 
developing this formalism. 

The subject could end with these banalities were it not for the 
fact that numerous individuals find it of interest to specialize 
the form of W, the nature of the variables, s, and the nature of 
the constraints. 

(1) For one thing, prices are not usually included in the welfare 
function itself, except very indirectly through the effects of differ- 
ent prices and wages upon the quantities of consumption, work, etc. 

(2) Also certain of the variables can be thought of as referring 
to a particular individual or family; e.g., one of the z’s may be the 
amount of tea consumed by John Jones, or the amount of unskilled 
labor which he provides. 

(3) It is often further assumed that the quantities of a given 
commodity consumed by one individual are of the same type as 
those consumed by another; technically, this means that certain 
of the variables enter in the technical side-conditions in sums, which 
relate the total amount produced of a commodity to input, regard- 
less of the ultimate distribution of that output. Whatever may be 
said about the admissibility of this, the matter is still worse when 
a similar assumption is made concerning the homogeneity of the 
various services provided by different individuals. However, even 
if in the most rigorous sense each individual’s talents are unique, 
society rarely has the time or patience to learn to appreciate the 
flavor of each man, and, in the absence of perfect “screening” of 
different individuals, it does treat them as if they were perfectly 
substitutable; thus for our purposes they may be assumed to be so 
in many cases. This does not mean that we work with a single 
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grade of labor; on the contrary, the number of grades may be very 
large and the classification minute, but it is assumed that there 
are many individuals in each grade, actually or potentially. 

(4) Not infrequently it is assumed that a given grade of pro- 
ductive sendee may be used indifferently in a number of uses. 
Technically this means that certain of the Ts enter into the welfare 
function only as certain sums. As Bergson has pointed out, this 
involves implicit value judgments, so that Robbins when he dis- 
cusses the problem of allocating resources as between alternative 
uses so as to maximize (in some sense) output or personal utility 
is not able even on the Robinson Crusoe level to avoid these no- 
tions; or rather, if resources are not assumed indifferent between 
at least two uses, few interesting marginal conditions can be 
deduced. 

(5) A more extreme assumption, which stems from the indi- 
vidualist philosophy of modern Western Civilization, states that 
individuals' preferences are to “count.” If any movement leaves 
an individual on the same indifference curve, then the social welfare 
function is unchanged, and similarly for an increase or decrease. 
Actually, an examination of the principles of jurisprudence, the 
folkways and mores, shows that in its extreme form this assumption 
is rarely seriously proposed. Even “sane” adults are not permitted 
to eat and drink what they think best, individuals cannot sell 
themselves in order to consume more in the present, milk ration 
tickets cannot be exchanged for beer at the will of the owner, etc. 

But economists in the orthodox tradition have tended to con- 
sider the above cases as exceptions. 23 However, in recent years 
many economists, Frank Knight being a notable example, have 

23 Consider, however, the following interesting quotation from Edwin Cannan who 
was very much in the classical tradition; “We shall never decide whether to put a penny 
on beer or to further steepen the super-tax on incomes by considering how much the 
loss of a penny pinches the beer-drinker and the duke; we shall, and we do, decide it 
by making some rough estimate of the aggregate advantage in the long run of the two 
methods to society at large. For example, if we find that cheaper beer means better 
food for underfed children while less super- tax means more training of horses to ran fast 
for a short distance with a very light burden, we incline to the super- tax: but if we End 
cheaper beer means more beer for drunkards and less super-tax means more houses for 
the people to inhabit in comfort and health, we incline to the beer tax.” From an 
Economic Journal review of Sir Josiah Stamp’s Fundamental Principles of Taxation in 
the Light of Modern Developments , reprinted in An Economist's Protest (London, 1927), 
p. 279. 
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insisted upon the degree to which individual tastes and wants are 
socially conditioned by advertising and custom so that they can 
hardly be said to belong to him in any ultimate sense. All this is 
recognized in the witticism of the soap box speaker who said to the 
recalcitrant listener, “When the revolution comes, you will eat 
strawberries and cream, and like it!” Attention should also be 
called to the fact that even the classical economist does not literally 
have the individual in mind, so much as the family; of course, some 
hardy souls will pursue the will-o’-the-wisp of sovereignty within 
the family so as to reduce even these collective indifference curves 
to an individualistic basis. 

(6) One does not have to be a John Donne then to find fault 
with the* above assumption, especially if we consider the closely 
related assumption that an individual’s preference depends only 
upon the things which he consumes and not upon what others 
consume. As Veblen characteristically pointed out, much of the 
motivation for consumption is related to the fact that others do or 
do not have the same thing. Conspicuous expenditure, “keeping 
up with the Joneses,” snob appeal, maintenance of face, are im- 
portant in any realistic appraisal of consumption habits ; and if we 
turn to the field of power analysis, it is not only on the national 
scale that “satisfactions” are relative|and dog-in-the-manger tactics 
are rational. 

If this sixth assumption is not made, many of the conclusions 
of welfare economics will remain valid, but they will require modi- 
fications to allow for certain “external” consumption economies 
not dissimilar analytically to the external technological economies 
and diseconomies of the Marshall-Pigou type. 

(7) All of the above assumptions are more or less tacitly ac- 
cepted by extremely divergent schools of thought. The next as- 
sumption involves a more controversial value judgment, but one 
which has been characteristic of much of modem thought of the 
last century, and which is especially typical of the beliefs of the 
classical and neo-classical economists. It is that the welfare function 
is completely (or very nearly) symmetrical with respect to the con- 
sumption of all individuals. 

Taken in connection with previous assumptions, in its strict 
form it is not consistent with the patent fact of considerable differ- 
ences in individuals’ overt preference patterns. Thus, in addition 
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to involving a very significant value judgment, it also involves a 
very definite assumption of fact. This was not appreciated by 
economists, who tended to believe in the desirability of an equality 
of income, leaving it to the individual to determine the exact form 
of his consumption. However, it is easy to show that the rule of 
equality of income (measured in dollars, numeraire, abstract pur- 
chasing power) applied to individuals of different tastes, but made 
to hold in all circumstances, is actually inconsistent with any de- 
terminate, definite W function. Equality becomes a fetish or 
shibboleth, albeit a useful one, in that the means becomes the end, 
and the letter of the law takes precedence over the spirit. 

For the decision of equal incomes as optimal in one situation 
implies a certain relative well-being as between vegetarians and 
non -vegetarians ; at different relative prices between vegetables 
and non-vegetables an equal distribution of income can no longer 
be optimal. Actually this does not render invalid the reasoning 
based upon this seventh premise since the adherents of this view 
implicitly held that individuals were very much alike, and if given 
equality of treatment would develop the same want patterns. 
Moreover, they could in all logic take the milder position that a 
great deal less inequality than existed in real life would be desirable 
even if one did not believe in complete equality. 24 

In a similar way, the belief that the individual should rightfully 
receive his imputed productivities is not consistent with a W func- 
tion having properties (1) through (6). A change in the techno- 
logical situation will alter individuals’ fortunes so that the final result 
cannot be optimal if the initial situation was deemed so. Perhaps 
the bourgeois penchant for laissez-faire is the only case on record 
where a substantial number of individuals have made idols of 
partial derivatives, i.e., imputed marginal productivities. 

One could similarly multiply expressions and beliefs which are 
part of everyday parlance, which upon examination turn out to be 
inconsistent and meaningless. The slogan, “the greatest good for 
the greatest number,” was shown by Edgeworth to be one such; 

24 It is not tlie purport of the above lines that the use of a welfare function leads to 
a belief in inequality as compared to equality. It merely shows that equality of money 
income where there is diversity of tastes involves the equality of nothing important. 
It is in lesser degree like the Ana tale France aphorism concerning the equality of the 
law in its treatment of rich and poor. Before Bergson's treatment one could have 
sensed, but not analyzed definitively, this subtlety. 
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and we might add the dictum, “each to count for one and only one.” 
As Professor Knight has ceaselessly insisted, Western Man is a 
hodgepodge of beliefs stemming from diverse and inconsistent 
sources. Fortunately his life is sufficiently compartmentalized so 
that he can assume his various roles with a tolerable amount of 
ambiguity in each; and only the most introspective worry about 
this enough to become disorganized. 

(8) A final essentially unnecessary assumption, which was espe- 
cially characteristic of the last generation of economists, was the 
definition of the welfare function, which was to be maximized, as the 
sum of cardinal utilities experienced by each individual. Before the 
time of Bergson this was not uncommonly met even in the advanced 
literature, 1 'and vestiges can be found today. It stemmed from the 
main Utilitarian stream of economic thought, when utility was used 
interchangeably in a behavioristic, in a psychological, in a physio- 
logical, in an ethical sense. 

It was not uncommon for older writers to ponder over the 
question as to whether utility was being maximized or whether 
pain was being minimized ; whether by and large man was operating 
“in the red” below absolute zero, but making the best of a bad lot. 
The answer depended upon the author’s theology and endocrine 
glands at the moment. Paley, Sidgwick, and others could seri- 
ously ask whether it was better to have a tremendous population, 
each contributing a little to a vast amount of Social Utility, or 
whether it was better to have less Social Utility, provided its 
average amount per head was maximized. 

In the field of Public Finance the assumption of additive indi- 
vidual utilities plus the law of diminishing utility were used to 
justify the imposition of progressive taxes. In its most sweeping 
form this doctrine set up minimum aggregate sacrifice or maximum 
total utility as appropriate goals of action. This goal can be ob- 
tained only if the marginal utility of income (after taxes) is equal 
for all individuals; or if individuals are essentially alike, only with 
equal incomes for all. 25 On the other hand, the criterion that a 
given sum of taxes should be raised so as to lead to equal sacrifice 
for all is a much more conservative doctrine; following it we can 

“F. Y. Edgeworth, “The Pure Theory of Taxation,” Economic Journal, VII 
( 1897 ), 550 - 571 . 
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only be sure that taxes should increase with income, but not 
necessarily in proportion with income . 26 

Today such arguments are not very fashionable since it is as 
easy to assume one’s conclusions with respect to appropriate policy 
as to assume the premises of these arguments. Not only is the 
former more direct, but it is more honest. Nevertheless, some of 
the considerations which enter into the above arguments are latent 
in much of modern discussion and thought. In the distribution 
of war burden the moderately well-to-do tend to point to their 
rather large sacrifices and to call upon the lower classes to share in 
these and make new ones. The relatively poorer farmers and 
laborers concentrate upon how much the rich have left after they 
have made substantial sacrifices, and how little they themselves 
have in any case, not in comparison with what they had before the 
war, but compared to what they consider fair. Whether war 
times are the appropriate times to redress ancient wrongs or not, 
it is too much to expect that the bargaining advantages which the 
war brings will not be used for this purpose. 

In connection with this eighth assumption it was implicitly 
assumed that real income could be treated as a homogeneous quan- 
tity to be distributed among individuals. This could be literally 
true only in a one-commodity world, or in a world where all relative 
prices were fixed constants. Actually prices will vary depending 
how money income is distributed. Strictly speaking, therefore, 
the real judgments embodied in the welfare function must be judg- 
ments concerning a multitude of diverse goods. This would be 

26 The condition of equality of sacrifice is satisfied at each level of income if 
U{X) - U(X - t) » constant for all X. 

Differentiating this so as to determine the explicit change in t with respect to X t 

- u '( x ~ ~ E$£2 

dX “ U'(X - i) 

Because of diminishing marginal utility this is positive. But if we wish to have pro- 
gressive taxation, the elasticity of income after taxes against income before taxes must be 
less than one. But 

X d(X-t) XU r (X) 

X-t dX (X - t)U\X - t) ’ 

which is less than one if, and only if, the elasticity of the marginal utility curve is less 
than unity. Thus, for Bernoulli’s law of utility, equality of sacrifice would imply pro- 
portional rather than progressive taxation. 
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quite a problem even for a man with definite opinions and great 
preoccupation with value judgments. If, however, he takes refuge 
in assumption (5) that individual preferences are to count, the 
individual can be left to decide for himself how he will spend his 
money at given prices. Our ethical observer need only decide 
then what his preferences are as between the given levels of satis- 
faction of different individuals. 

It might be thought that our ethical observer, even if the indi- 
viduals themselves had no unique cardinal indexes of utility, would 
have to find cardinal indicators. But this would be quite incorrect. 
Of course, if utilities are to be added, one would have to catch hold 
of them first, but there is no need to add utilities. 27 The cardinal 
utilities qnter into the W function as independent variables if as- 
sumption (5) is made. But the W function is itself only ordinally 
determinable so that there are an infinity of equally good indicators 
of it which can be used. Thus, if one of these is written as 

W = F{U l , U 2 , • • •)> (13) 

and if we were to change from one set of cardinal indexes of in- 
dividual utility to another set (F 1 , V 2 , ■ • ■ ) , we should simply 
change the form of the function F so as to leave all social decisions 
invariant. Thus, let us move from one configuration of goods 
going to the different individuals to another configuration which 
leaves W unchanged or which is socially indifferent. Then no 
redefinition of the U’s or of F can change this fact: the social in- 
difference loci are independent of cardinal numbering. And in this 
terminology the significance of assumption (5) that individual 
tastes shall count is contained in the requirement that the social 
indifference slopes between two goods going to the same individual 
are exactly the same as the individual’s indifference ratios. As- 
sumption (6) adds that these curves are unaffected by changes in 
the goods going to other individuals. 

27 Even if one wished to add utilities, it would seem silly from any ethical viewpoint 
to have one’s opinions as to correct taxation influenced by how consumers spend their 
income on sugar. Yet that is what the recent attempts to measure marginal utility 
must imply if they have any pretensions to relevance for policy. R. Frisch, New Methods 
of Measuring Marginal Utility (Tubingen, 1932). I. Fisher, “A Statistical Method for 
Measuring ‘Marginal Utility’ and Testing the Justice of a Progressive Income Tax,” in 
Economic Essays Contributed in Honor of John Bates Clark (New York, 1927). 
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Mathematical Analysis 

The assumptions discussed above can be given mathematical 
formulation. Thus, assumptions (2) and (3) imply that the impor- 
tant variables of our system can be regarded as a number of com- 
modities and productive sendees (X u X«, ■ • X„, V h V«, ■ • V m ). 

These totals can be distributed among 5 individuals of the system; 
the amounts going to each can be written with a superscript which 
identifies the individual and a subscript identifying the commodity 
or service. (While productive sendees could be written as negative 
commodities, I have chosen to conform to the more common pro- 
cedure encountered in the literature.) 

Xi = £ x/, (i = 1, • • • , ri) 

(14) 

Vj = £ Vj k . (j = 1, • • •, m) 

The social welfare function involves only the amounts going to 
each individual, not prices or totals. Therefore, equation (11) 
is specialized to 

W- = W(x x 1 , • • • , Xn 1 ; ••• ; xY, 

01 1 , • • • , Vm 1 ; • • • ; Vl\ • • • , (IS) 

This exhausts the implications of the first four assumptions. 

Assumptions (5) and (6) further specialize (11) so that the 
welfare function can be written as 

W = • • • , x^; Vi 1 , ■ ■■, vj), • • • , 

U*(xi’, •••,*»*; £'i s , • - • , 0™*)], (16) 

where the cardinal forms of the U's and the W are arbitrary. 
Assumption (7) is meaningless unless the respective U's can be 
made identical; but if this can be done, W must then be a sym- 
metric function of the U’s. Assumption (8) requires that there 
exist a cardinal W and cardinal U’s such that W is a summation of 
the U’s. On the anthropological level this involves (aside from its 
arbitrariness) definite restrictions on the social rates of indifference 
between the commodities and services of the same and different 
individuals. These are similar to the empirical restrictions of 
independent individual utility as discussed in chapter vii. 
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In the previous section it was noted that the “technical” re- 
straints must be assumed with the same arbitrariness as the welfare 
function itself. However, since the formulation of general equi- 
librium by Walras, it has been customary to take as given by the 
engineer the fundamental relationships between inputs and outputs 
and that production itself takes place in firms or industries which 
are distinct from the individuals, having no value in and of them- 
selves. Under modern industrial conditions this is not unrealistic. 
But even here many interesting alternatives can arise. What one 
calls economics, economic engineering, engineering, etc., is to a 
considerable degree a matter of choice. One can assume that all 
production decisions involving relative marginal productivities are 
the concern of the engineer, or economic engineer, and that the 
economist can take as already derived a transformation relationship' 
between the X’s and F’s of the form 

T(X lt Fi, F a , ■ • • , F«) = 0. (17) 

This implicit relation is interpreted to give the maximum amount 
of any one output as of given amounts of all inputs and all other 
outputs, or the minimum amount of any one input as of given 
amounts of all outputs and all other inputs. 

But if one starts from more primitive technological assumptions, 
such as the production functions of each commodity, the trans- 
formation locus is a derived theorem and not an axiom. Behind 
it lie many interesting optimal production conditions involving 
marginal productivities and other magnitudes usually thought of 
as economic rather than engineering. 

Production Conditions 

In a synthesis of welfare economics we can derive first a set of 
production conditions which require for their validity the weakest 
of all ethical assumptions: simply that more of any one output, other 
commodities or services being constant, is desirable; similarly, less 
input for the same outputs is desirable. 

Under the simplest technological conditions we can take as 
given the production function relating each output to the inputs 
devoted to it. 

Xi — X*ipu, Vi i, • • *, Vmi), (i — 1, ■ ■ ■, n) 


( 18 ) 
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where the first subscript indicates the kind of productive factor 
and the second subscript the commodity to which it is applied. Of 
course, the total applied to all commodities is given by 

Vj = Vji + Vj 2 + * * • + Vjn. (j = 1 , * * *, m) (19) 

Ordinarily it is assumed that the production function possesses 
partial derivatives (marginal productivities), but the above form- 
ulation includes the case of so-called fixed proportions or fixed 
coefficients where the production function has contour lines with 
corners. 

Equations (18) and (19) represent (n + m) relations. If we 
specify arbitrarily all but one output (sendee), we can maximize 
(minimize) the remaining one. This is an extremum problem in 
which there are auxiliary constraints. The first order partial de- 
rivative conditions can be expressed directly as proportionalities 
and rank properties of the matrix of first partial derivatives of our 
functions. But it is illuminating to express these directly by 
means of the artifice of Lagrangean undetermined multipliers. To 
do this we set up the function 

$ = T 1 X 1 T it 2 X 2 T * * * 

+ T n X n + XxFa + X 2 F 2 + • • • + X m V m (20) 

and pretend to maximize it treating all the variables as if they were 
independent, and treating the ws and X’s as (undetermined) con- 
stants. If the correct secondary extremum conditions are written 
out in full, it will be seen that they are quite different from those 
that would have to hold if the $ function were really to be maxim- 
ized. Only by accident would these secondary conditions coincide; 
only by accident would $ not be at a minimum or would it be at 
an extremum rather than simply at a stationary value. 

This may seem to be an esoteric and recondite point of little 
practical significance. It is the primary contention of the present 
work that everything interesting is contained in the Inequalities as- 
sociated with an extremum position, rather than in the equalities. 
This Is no less true of the field of welfare economics. From a 
deeper point of view market prices , regarded as parameters by perfect 
competitors , are nothing more nor less than Lagrangean multipliers . 
The Langrangean expression $ corresponds to Barone's $ function 
or to Hotelling's price potential and can be regarded as the value 
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of output or national product, expressed in terms of money or any 
numeraire, and generalized by the subtraction of factor costs. 

If the game of competition leads to optimal conditions, it does 
so partly by accident. For it is precisely under the conditions 
favorable to the maintenance of atomistic competition (briefly 
absence of decreasing costs) that the secondary conditions of the 
correct maximum problem agree with those guaranteeing the max- 
imization of the money value of output as of fixed prices. Where 
there are substantial technological increasing returns, competition 
as an empirical phenomenon breaks down. It is in such conditions 
that collectivism is likely to be considered seriously as a social 
policy. If then a socialist regime should insist upon mechanically 
playing r the game of competition with prices regarded as parameters 
a la Lange , 28 its managers would rush away from the minimization 
of <f>, even though this minimization may be precisely what is 
required under decreasing cost if welfare is to be maximized. 

The moral is not that prices should cease to be related to mar- 
ginal cost, or that planning under socialism is impossible. It is 
simply that the decentralized operators in a planned society should 
refrain from a literal aping of atomistic, passive, parametric price 
behavior. Instead of pretending that demand curves are infinitely 
elastic when they are not, the correct shape of the curve is to be 
taken into account. This does not mean that the decentralized 
operators should take account of their influence on price as a 
monopolist would . 29 

After the elimination of Lagrangean multipliers, the first order 
maximum conditions take the form 

ax, 

dXk 

a^xi, 

28 O. Lange, article in On the Economic Theory of Socialism , B. E. Lippincott, ed. 
(Minneapolis: University of Minnesota Press, 1938), pp. 55-142. 

28 When we come to the full statement of welfare conditions, it will be seen that 
unusual difficulties do arise in the decreasing cost case in determining whether a given 
maximum position represents a maximum maximorum or whether the number of differ- 
entiated products should be reduced. 


dXi 

= = Ilf = ^ = •••,«) (21) 
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or the equivalent form 


(j, r = 1, • • - , m) (22) 


In words this takes the form : productive factors are correctly 
allocated if the marginal productivity of a given factor in one line is 
to the marginal productivity of the same factor in a second line as the 
marginal productivity of any other factor in the first line is to its mar- 
ginal productivity in the second line. The value of the common factor 
of proportionality can be shown to be equal to the marginal cost of the 
first good in terms of the ( displaced amount of the) second gaod. so 

Geometrically these conditions can be easily derived in the two 
good, two factor case by means of a Jevons-Edgeworth-Bowley- 
Lerner diagram consisting of a box whose respective sides are 
equal to the available total amounts of the two factors. Any 
point within the box, if oriented with respect to the lower left-hand 
comer, can be thought of as representing the amounts of the two 
factors used in the production of the first good, and the iso-product 
contour lines can be sketched in. The same point when referred 
to the upper right-hand comer represents the allocation to the 
second good, whose contour lines can also be superimposed on the 
same diagram. 

If we specify the amount of one of the goods and restrict move- 
ments to one of these contour lines, the optimum position is reached 
only when we have touched the highest output line of the other 
good, or at a point of tangency of the opposing iso-product lines. 
The geometrical diagram indicates the correct secondary' condi- 
tions which are quite different from conventional diminishing re- 
turns. The locus of ail such points of tangency, which can appro- 
priately be called a “generalized contract” curve, represents the 
infinity of optimum positions. If along this locus we read off the 
amounts of the respective products, and plot one of these magni- 

30 By introducing as variables inputs and outputs of different dates optimal behavior 
over time can be included in the above formulation. However, when this is done, it 
will be seen that contrary to the belief of most economists since the time of B3hm- 
Bawerk, no single real interest rate is implied for a capitalist or socialist state. 
Equality would be a necessity only in the highly unusual case where relative prices of all 
goods remain the same over time. 


dXj dXn 

b Oj 1 dzijn 

axT" = dX^ = 
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tudes against the other, the resulting locus is the substitution, 
transformation, or opportunity cost curve, T. The slope of this 
curve at any point represents the marginal cost of one of the goods 
in terms of the other, or the ratio of marginal costs expressed in 
terms of some third magnitude . 31 

The substitution curve is drawn up as of given amounts of the 
factors of production and will shift with any changes in the latter. 
For a given transformation curve to be of any relevance, the factors 
of production must be regarded as being indifferent as between 
different uses . 32 

If constant returns to scale and only one factor (perhaps equal 
to a composite dose) are assumed, we have the classical case of a 
straight Jine transformation curve. If the first of these assump- 
tions is made, but there is more than one factor of production, 
unless the goods should just happen to use the factors in the same 
proportion, the transformation curve will be concave to the origin 
because of the law of diminishing returns to varying proportions. 
The curve can also assume this shape for other reasons relating to 
returns to scale. In the decreasing cost case, however this arises, 
the curve will be convex to the origin. 

In all of these cases let the firms or planners really take prices 
as given and attempt to maximize value of output or <f>. In the 
constant returns case there will be complete specialization on one 
of the goods (the one with comparative advantage) or complete 
indifference at the critical price ratio equal to the cost transforma- 
tion ratio. If both goods are consumed, and if the economy is a 
closed one, the latter critical price ratio is sure to be the only 
relevant one. 

In the increasing cost case at given price ratios the firm will 

31 For the geometrical picture see W. F. Stolper and P. A. Samuelson, “Protection 
and Real Wages,” Review of Economic Studies , IX (1941), 58-74. 

32 It is only when the last two unnecessary and unrealistic assumptions are made 
that the so-called opportunity cost doctrine is even formally valid. Even under these 
conditions the usual formulation is rather a mumbo jumbo of high sounding gibberish, 
which does not state the conditions of equilibrium in a very direct way. All this is 
further complicated by the fact that most enunciations of the opportunity cost doctrine 
are purely verbal: leisure is treated as a displaced good ! Inevitably, therefore, when 
the opportunity cost doctrine is carefully stated and qualified, it degenerates into the 
full conditions of general equilibrium in which factor supply and preference equations 
must be introduced if only as inequalities. This is not to imply that one must accept 
the dubious psychological language and interpretations of classical real cost theorists. 
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come into equilibrium at the levels of output which make prices 
proportional to marginal (transformation or factor) costs. How- 
ever, in the decreasing cost case the planners in a socialist state at- 
tempting to maximize at fixed prices would concentrate on one or the 
other of the two commodities. But this would be optima! only in 
an open economy which could really trade with the outside world 
(in unlimited amounts) at the given price ratio. I n a dosed planned 
state it might be desirable to produce something of both, to have 
price ratios equal to marginal cost ratios even if that meant a mini- 
mization of <3?. If we regard the passive approach of the decen- 
tralized planners as an equilibrium process, it is clear that the cor- 
rect optimum, being a minimum and not a maximum, would not be 
an equilibrium point at all. Or if the planners acted like Balaam’s 
ass, it could be regarded as a highly unstable equilibrium point 
from which they would rush at the slightest disturbance. 

Thus it is only accidentally — the first two cases — that com- 
petition is optimal. For it is in those cases that the Lagrangean 
expression (20) is at a maximum when we are at the correct maxi- 
mum of (18). In these accidental cases, if we like, we may think 
of our production decisions as being independent of preferences, in 
the sense that they are uniformly best with respect to the most 
extreme weightings of the commodities. Under increasing returns 
we may even never settle on society’s best transformation curve. 

Actually our production conditions might have been introduced 
in more complex form than as single, isolated production functions 
of the form (18). It is to a certain extent arbitrary as to how much 
preliminary maximizing behavior we consider as being done by the 
engineer and how much by the economist. Under competitive 
conditions the relevance of prices to the decision is taken as the 
dividing line; but for welfare purposes we may dispense with prices 
completely. However, it would seem somewhat awkward then to 
term the problem one of engineering; still some economists 
might prefer to begin with the transformation function, T. 

If we examine production possibilities more closely, it will be 
apparent that in many cases it will not be possible to end up with 
a single transformation function, T. Thus suppose that commodi- 
ties divide up into two or more groups such that no productive 
factor is used in more than one group. Then there will be at least 
as many independent transformation curves as there are groups. 
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Of course, formally these can be combined into a single implicit 
equation, e.g., by equating their summed squares to zero, but this 
is trivial. No indeterminacy arises, however, from the multi- 
plicity of transformation curves; we have fewer choices to make 
and therefore fewer marginal conditions. 

Pure Exchange Conditions 

The production conditions of the previous section were deriv- 
able on the basis of rather mild assumptions by virtue of the fact 
that more of all goods, however they are to be divided, would seem 
better than less. These conditions are still only necessary. They 
are not sufficient, since decisions must still be made as to how to 
divide up given available goods, and as to which of the available 
amounts of production will actually be used. 

Even if the last of these two decisions is made, there remains 
the problem of dividing up a given total of all commodities and 
services among individuals. Once there has been specified a clearly 
defined W function, the final optimum point can be easily deter- 
mined. But at this point after assuming, if only implicitly, the 
first six serious assumptions, the modem welfare economist be- 
comes timid and hesitates to make assumption (7) or indeed any 
other specific welfare function. He has been conditioned against 
making ethical assumptions. If possible, therefore, he would like 
to develop further optimum conditions which are within broad 
limits independent of the form of the W specified. In fact, as has 
been known since the time of Pareto at least, it is possible to 
gratify this wish and to specify still another set of necessary, but 
not sufficient, conditions which must prevail as between individuals. 

Here the simple assumption is made that W is undefined, or 
unspecified, except to the extent that it is a monotonic increasing 
function of the individual U’s. We are not able to plot in the 
U 1 — V 2 plane a locus of points along which W is constant. We 
know only that a movement in the northeast direction of the plane 
increases W and is good, a movement in the southwest direction 
decreases W and is bad ; a movement into either of the other two 
quadrants taken around a given point is indeterminate. 

In words, the simple assumption is made that only if all indi- 
viduals are made better (worse) off can we definitely state that a 
given movement is good (bad). Otherwise we must reserve judg- 
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ment. Mathematically it is clear that for the levels of utility of 
all but one individual being arbitrarily specified, it is a necessary' 
condition of equilibrium that the utility of the remaining individual 
be at a maximum, subject to the condition that there be fixed totals 
of all goods. For 
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By using Lagrangean multipliers or by direct methods, it can 
be easily shown that in equilibrium the ratio between marginal 
utilities of two goods consumed by one individual must be the 
same as the ratio of marginal utilities of the same goods to any 
other individual who consumes the same goods. If one or more 
of the goods is absent from the consumption of an individual, cer- 
tain inequalities can be introduced to generalize the above condi- 
tions. We may write the first order conditions of equilibrium in 
the form 
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It will be noted that only ratios of marginal utilities for the same 
individuals are ever involved . 33 Thus, there is no need for nu- 
merical utility even for a single individual and no need to compare 
the utilities of different individuals. 

Graphically the equilibrium may be represented by the same 
box-type diagram as was mentioned in the last section. The di- 
mensions of the box represent the fixed totals of the commodities 
in existence. Any interior point oriented with respect to the 
lower left-hand comer represents the amount of consumption of 
the first individual, and the contour lines are now to be interpreted 
as indifference curves. Referred to the upper right-hand comer, 
a point represents the second individual’s consumption, and his 
contour lines may be superimposed on the other. Specifying the 

33 Equilibrium conditions regarding the factors of production need not be indicated 
separately if it is remembered that they may be treated as negative commodities. 
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utility of the second individual, we maximize that of the first by 
moving along the given indifference line of the second individual 
until we reach (at a point of tangency) the highest indifference 
curve of the first. Because the original specification of the second 
man’s utility was arbitrary, the final equilibrium is also arbitrary 
and not unique. The locus of all such arbitrary points is, of 
course, the familiar Edgeworthian contract curve and represents 
the set of points satisfying necessary exchange conditions. 

After the discussion of the previous section it will not be neces- 
sary to reiterate at length the contention that the equality of the 
various ratios can be expressed without Lagrangean multipliers 
and without price ratios; moreover, in by no means impossible 
cases, even playing the game of competition would lead away from 
rather than toward the correct optimal position. As before, this 
is because the secondary conditions for a constrained maximum 
are not the same as the secondary conditions necessary literally to 
bring the Lagrangean expression to an extremum. 

It will be noted that from any point off the contract curve there 
exists a movement toward it which would be beneficial to both 
individuals. This is not the same thing as to say, with Edgeworth, 
that exchange will in fact necessarily cease somewhere on the con- 
tract curve; for in many types of bilateral monopoly a final equi- 
librium may be reached off the contract curve. Nor is it the same 
as to say that points on the contract curve are better than points 
off the contract curve. Later I shall discuss the correct formula- 
tion of the significance of this condition. 

But first it is well to write down the full first order conditions 
of equilibrium which must hold if we are to have an optimum of 
both production and exchange, that is, if we combine with the 
conditions of this section the conditions of the last section. Then 
(i) we must have a common marginal rate of indifference between any 
two goods for every individual; this common indifference ratio must, 
moreover, be equal to the ratio at which one of these goods can be 
transformed into the other in a production sense, the transformation 
to come about as the result of transferring any resource from one 
good's production to the other's. (2) We must have for all individuals 
a common ratio of indifference between supplying more of any factor 
of production and enjoying more consumption of a given good; this 
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common ratio must be equal to the rate at which supplying more of 
that factor results in greater production of the good in question. 
Mathematically, 
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These conditions may be rewritten in a variety of ways but only 
being (at the first order level) necessary conditions rather than 
sufficient conditions, they are necessarily less in number than the 
number of the unknowns of the system. This is after taking ac- 
count of the fact that the total of factors used in every use and the 
total of commodities consumed must equal the sums of these mag- 
nitudes over all individuals, i.e., even after taking into account 
equations (14). We are still short (5 - 1) equations. These can 
only be supplied on the basis of definite assumptions concerning 
how different individuals enter into the W function. This will be 
discussed in detail in the next section. 

Returning to the combined production and exchange conditions 
of equations (26), I should like to point out various alternative 
formulations of them. First, it is easy to derive from them the 
conditions that the ratios of indifference between any two factors 
must be uniform for all individuals, and equal to the relative 
marginal productivities of the two factors in an}*- line of production. 

Second, it is sometimes convenient to interpret the first of these 
equilibrium conditions as specifying the equality of price and mar- 
ginal utility ratios to the ratios of marginal cost. There is occa- 
sionally some uncertainty in the literature as to whether prices 
should equal marginal costs or just bear the same percentage dis- 
crepancy in every line. The distinction is especially important if 
decreasing costs are very prevalent, for then it is impossible for 
capitalistic monopolies or socialistic bureaus to recover their full 
costs of production if they charge only prices equal to marginal 
costs. Recourse must be had to direct taxation to cover the differ- 
ence or to the imputed revenues accruing to state-owned factors of 
production, revenue which in a world of constant returns to scale 
would be available for distribution to the citizens of the state. 
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So long as only price ratios are used it is not necessary to state 
the units in which costs are measured, dollars, labor, fertilizer, or 
opportunity-substitution costs. In the usual exposition partial 
equilibrium conditions are implicitly assumed, and costs, wages, 
and prices are expressed in terms of dollars. In these conditions 
must prices be equal to marginal costs, or just proportional? The 
answer is equality if we take into account the conditions relating to 
factors of production such as are embodied in the second part of 
equations (26). If all factors of production were indifferent be- 
tween different uses and completely fixed in amount (the pure 
Austrian case), then we could dispense with these conditions, and 
proportionality of prices and marginal cost would be sufficient. 
But if we drop these highly special assumptions, for which there is 
not in any case empirical or theoretical warrant, then if all prices 
were proportional to (say double) marginal costs, we should not 
have an optimum situation. By working a little more or less 
everybody could be made better off since in the described situation 
the preferred terms at which they exchange leisure and goods are 
not equal to the true productivity terms at which they can be 
transformed into each other. 

I need not remind the reader that first order conditions are 
really of secondary importance as compared to the full inequalities 
implied by an optimum position. Also it should not be necessary 
to show in detail that the above conditions hold only if every factor 
of production is actually used in every line of production, and if 
something is produced of every good, and if all production and 
indifference functions have continuous partial derivatives. If any 
of these conditions fail, the number of equalities may be diminished, 
but in every case the correct general inequalities will prevent there 
being any essential ambiguities in the characterization of the 
optimum position. A closely related problem arises when there 
are multiple positions of equilibrium. With strongly convex in- 
difference fields and strongly concave cost functions such cannot 
occur. But there is no reason why human nature should display 
the simple, regular properties which the observer-economist finds 
convenient. And it is the essence of the decreasing cost, increasing 
returns, lumpiness, indivisibility phenomena that improper curva- 
ture on the cost side should enter the picture. 

Especially where differentiation of product arises as a result of 
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monopolistic competition or as its cause, there arises the problem 
as to whether or not a commodity should be produced at all. If it 
is produced, the marginal cost conditions of (26) should be realized, 
but there may be a better maximum where none is produced. 
Here the extremum position is of the comer type, and the conven- 
tional equalities must be replaced by inequalities. This involves 
decision making at a distance; we cannot feel our way to the op- 
timum, step by step, but must boldly experiment with diverse 
combinations. Where such “all or none” phenomena are con- 
cerned, things often get worse before they get better, and so deci- 
sions im kleinen will not suffice. Even aside from the difficulties of 
the paragraph after next, we cannot decide that a thing should be 
produced if a perfectly discriminating monopolist or government 
bureau could recover its total cost by some sort of a take-it-or- 
leave-it offer. In these cases involving finite decisions we must 
ask consumers (or Robinson Crusoe) whether a given abundance of 
fewer commodities is preferred to an alternative scarcity of a 
greater range of commodities. 34 At this point there will inevitably 
arise questions concerning the rationality of individual choice, 
questions which had been summarily suppressed by a cavalier 
acceptance of the validity of assumptions five and six set out in 
the third section back. But these questions, whatever their im- 
portance, cannot be settled by deductive analysis. 

The equality of price to marginal cost creates confusions in the 
minds of many concerning the question as to whether or not costs 
which are fixed in the short ran are not variable in the long run and 
hence should be covered by price. They ask whether price should 
not exceed short run marginal cost since the latter excludes elements 
of long range variable costs. There are two confusions revealed by 
this question. First, marginal cost is sometimes treated as a part 
of total unit costs, which it is not. Marginal cost is the difference 
between costs in two situations and cannot be identified in general 
with given components of cost, labor, materials, etc. Sympto- 
matic of this confusion is the statement that competition tends to 
drive price down to prime or variable costs so that full costs are not 
recovered. Of course, pure competition (which is not really im- 

" In certain special cases consumer’s surplus may be employed to describe finite 
inequalities. But these cases are rare, and in any case we are rather better off if we use 
direct methods. 
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plied in the previous sentence) involves price equal to marginal 
cost, which may far exceed full unit costs depending upon the scale 
of output and the level of price. Not only did Alfred Marshall 
speak loosely on this matter, but so great a contribution as J. M. 
Clark’s Economics of Overhead Costs errs, at least in exposition, in 
its diagnosis of the role of overhead costs in the breakdown of 
competitive theory. An examination of the character of agricul- 
ture, the last stronghold of pure competition, would show a high 
level of overhead costs with no tendency toward the breakdown of 
competition. Aside from non-imitable differentiation of product 
the principal cause of disintegration of competition is decreasing 
long run unit costs of each firm at levels representing large fractions 
of total demand for the product; this would be true even if all goods 
could be produced to order. 

Marginal cost is not part of cost which must be met, and the 
equality of price and marginal cost has nothing to do with a re- 
covery of full costs, determination of fair return on investments, 
correct imputation of factor shares, etc. Its purpose is to secure 
correct factor allocation, and avoid anomalous product allocation. 
This is accented if one considers a wavy cost curve or a cost curve 
showing a comer at the point of capacity (long run or short run). 
At the latter point marginal cost is undefined, or, if you prefer, it 
is anywhere from a finite amount to infinity. There is no clue as 
to proper price determination, but this is of no consequence so long 
as production does actually take place at the capacity point. 
Similarly, as demand fluctuates for a commodity whose cost curve 
shows waves because of lumpiness, etc., the correct price charged 
will vary greatly; rather than let a train set out with a single empty 
seat, the price should fall to zero. It is no argument against this 
statement that such pricing may not recover full costs, or that 
passengers will tend to wait until trains are least crowded. What 
could be more desirable socially than the evening out of traffic? 
Does not a railroad company which gives out passes to its own 
employees often follow the sound principle that they may ride as 
they please so long as they add nothing to costs? Incidentally, it 
will be seen from this discussion that where small changes in de- 
mand create great changes in social cost, posted, administered, 
relatively stable prices are not optimal devices. 

The second and less important confusion involved in the state- 
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ment that price should exceed short run marginal cost stems from 
the fallacious belief that long run marginal cost "inclusive of 
changes in long run variable factors” is greater than short run 
marginal cost. Actually, we know from the Wong-Viner-Harrod 
envelope theorem that they are equal for instantaneous rates of 
change if production is taking place at the anticipated level. For 
an actual, finite, forward movement from this point, short run 
marginal cost “exclusive of the fixed factors” is of course greater 
not smaller than marginal cost when all factors are varied opti- 
mally. Whether or not output is at the anticipated level, ‘price 
equal to short run marginal cost is necessary in order that the 
existing plant be used optimally; the relationship between price 
and long run marginal cost is relevant to a decision as to whether 
to change the size of plant, when that decision arises. If one 
dropped the first order statement of optimum conditions and re- 
placed them by the general inequalities which must be satisfied, 
it would be obvious that the optimum position must satisfy a 
different maximum condition (inequality) for every alternative 
which can be contemplated. Thus, it must not pay to make a 
small step forward or backward, a middling step, a sizable step, 
a huge step, to shut down completely or open a new line, etc., etc. 
Each of these implies a relationship between price or revenue and 
entirely different measures of marginal or differential cost . 35 

Interpersonal Optimal Conditions 

In the last two sections an attempt was made to derive as 
general conditions as possible with a minimum of controversial 
assumptions. Nevertheless, we have seen that it is not possible to 
deduce a unique equilibrium unless we have more to build upon. 
This is only as it should be, for intuition assures us that there can- 
not be an optimum position which is independent of the exact form 
of the W function. Even if all of the necessary conditions of 
production and exchange are satisfied, we are short as many equa- 
tions as there are individuals in excess of unity. In a Robinson 
Crusoe world where there is but one individual (pace Friday !) there 

35 Once this fact is mastered, the whole question of * ‘allocation of perfectly joint 
costs” is seen to be a false and irrelevant problem for either a firm or society. It derives 
its present day empirical importance from the intrusion, not necessarily irrationally, of 
“full cost” considerations into price formation, and to problems of government regula- 
tion, e.g., T.V.A., Tariff Commission, etc. 
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are no individuals in excess of unity, and the equilibrium is unique. 
But as soon as we have more than one individual, our previous 
conditions simply assure us that we are on the “generalized contract 
locus” from which there are no possible movements which are 
advantageous to each and every individual. 

An infinity of such positions exists ranging from a situation in 
which all of the advantage is enjoyed by one individual, through 
some sort of compromise position, to one in which another indi- 
vidual has all the advantage. Without a well-defined W function, 
i.e., without assumptions concerning interpersonal comparisons of 
utility, it is impossible to decide which of these points is best. In 
terms of a given set of ethical notions which define a Welfare 
function the best point on the generalized contract locus can be 
determined, and only then. 

All of this may be formulated mathematically as follows. The 
optimal conditions of production and exchange as given in equa- 
tions (26) which summarize the argument of the previous two sec- 
tions permit us to reduce the level of indeterminacy of the system 
to an implicit equation between the levels of well-being of the 
different individuals in the system. This may be written as 

P(U\ U\ •••, Z7 8 ) = 0. (27) 

This says that we may specify at will the levels of well-being of 
all but one of the individuals, and the last one’s well-being is 
uniquely determined. The essential shape of this Possibility func- 
tion depends of course upon the technological and other assumed 
restraints of the system as well as upon the tastes of the different 
individuals. Notationally, the same implicit function can be 
written in many different ways, but the locus in question is invari- 
ant under these purely terminological changes. One should note, 
however, that since the exact numerical form of each U is arbitrary, 
no importance attaches to the curvature properties of this locus. 

If we have a given definite welfare function, then we are to 
maximize 

W = W(IP, ■ ■ - , U‘) (28) 

subject to the above constraint. The first order condition of 
equilibrium takes the form 

Wi _Pi 

Wj - Pi • 


(i,j = 1, • • -, s) 
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These conditions, like the correct secondary conditions which need 
not be written explicitly, are independent of purely notational 
ambiguities involved in the welfare function, the possibility func- 
tion, and the selection of particular cardinal indices of individual 
utility. 

Equations (29) give us the missing (5 — 1) conditions of equi- 
librium, and our equilibrium is seen to be finally determinate. It 
is not at all necessary to take the expositional path followed above. 
Thus, Professor Pigou, who does not hesitate from the very be- 
ginning to make inter-individual comparisons of utility, moves 
directly via a maximization of (1 1) subject to (12) to the same equi- 
librium which Pareto-Barone-Lemer reach after they have ad- 
mitted into their systems interpersonal comparisons of, utility. 
And as will be discussed at greater length below, they end up with 
no optimum and no final definite conclusions if they are unwilling 
to permit others to introduce such ethical considerations. 

The real significance of their analysis lies in the fact that they 
provide a relatively easy opening for the introduction of ethical 
notions. For it follows from their analysis that all of the action 
necessary to achieve a given ethical desideratum may take the form 
of lump-sum taxes or bounties. These may be given in the form of 
abstract purchasing power or in kind, it being necessary, however, 
that the latter should be freely exchangeable against other goods. 
Thus, instead of having to decide how much to allocate of each and 
every good to each individual, the ethical authority need only 
decide on the allocation of final incomes between individuals. 

Working directly from a well-defined (ordinal) welfare function, 
we require that W be maximized subject to the physical constraints 
of the system. We may express the final conditions in many 
different forms, with or without Lagrangean multipliers. One such 
compact method which avoids as far as possible duplicating the 
exact form (but not the substance) of the conditions of the previous 
section is given as follows: 




(30) 
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Not only are these equations very similar to the famous Note 
XIV which summarizes much of what is best in Marshall’s Prin- 
ciples, but they are cast almost exactly in the form followed by 
Professor Pigou in his Economics of Welfare. In words, two condi- 
tions are implied : first, the marginal social utility ( disutility ) of the 
same good {service) must be equal for each individual; second, each 
factor of production must be divided among different possible uses so 
that its indirect, derived marginal social utility must be the same in 
every use and equal to its marginal social disutility. 

We may leave to the reader the task of showing the necessary 
modifications if the supply of a factor is inelastic, if a factor of 
production is not indifferent between different uses, if there is joint 
production, if there are external technological economies or dis- 
economies (so that the production function of a good contains in 
it factors of production devoted to other uses), etc. By dividing 
the above equations through by any single partial derivative, they 
may be thrown into a form which is independent of the particular 
cardinal representation of W. 

While the Lemer production conditions are contained in the 
above set of equations, the exchange conditions are not. However, 
if we make the individualistic assumptions five and six above so 
that the welfare function takes on the special form given in equa- 
tion (13), then in virtue of the identities 

dW _ dWdUf 

dx{ 8U r dXi r 

the exchange conditions of the previous section are also included 
in these equations. Thus, the production and exchange conditions 
which constitute the “new welfare economics” are included in the 
old, but are themselves incomplete. 

If we substitute the last identities in the fundamental equations 
(30), they can easily be thrown into the production and exchange 
conditions given in (26) plus the new interpersonal element guaran- 
teeing us that the distribution of incomes and lump-sum allowances 
is optimal. This takes the mathematical form of equality of 
marginal social utility of income (expressed in terms of any good) 
for all individuals. 

dW = dW ' _ dW 

dXi 1 dXi 2 dxi" 


( 32 ) 
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The statement that any “individualistic” ethical optimum can 
be effected by lump-sum taxes is a theorem, not an axiom. For its 
proof we require only that the individual indifference curves be such 
that they can be swung into final equilibrium by operating upon 
their incomes, letting them trade mechanically at fixed prices. 
The vast importance of this theorem for social policy in a planned 
or free enterprise economy is deserving of the greatest emphasis. 
Yet its greatest significance is in the realm of administration and 
tactics. From the standpoint of the logic of welfare analysis it is 
not fundamental. In fact it is not universally true. 

First, the indifference curves of one or more individuals may 
have curvature properties such that the individual who believes 
or pretends to believe that prices are parameters beyond his control 
will rush away from rather than toward the correct optimum posi- 
tion. The remarks in the section above dealing with production 
which discuss the analogy between prices and Lagrangean multi- 
pliers and secondary conditions may be referred to here. 

Second, and of more practical importance, the optimum position 
may be reached by means of discriminating all or none offers, 
sliding scales, etc., rather than by trading at uniform prices. The 
important thing is the realization of appropriate marginal condi- 
tions rather than what happens to the intra-marginal units. (Com- 
pare on this point the discussion above of recovery of full cost as 
a goal of pricing.) In a regime such as this, lump-sum allowances 
may not suffice to realize the optimum point. 

Third, it is not really easy to devise in practice a tax or subsidy 
which is of a purely lump-sum character. A tax upon income 
affects marginal decisions with respect to effort and risk-taking. 
This is obvious. Less conspicuous is the fact that the usual ethical 
notions compel us to set our allowances according to a man’s 
“circumstances,” which are partially the result of his own actions 
and decisions. Analytically, the problem resembles that of deter- 
mining fair piece rates or of setting a fair “handicap” for golfers 
of different ability. We wish to equalize opportunity for all con- 
testants, but we do not want them to hold back from playing their 
best because of a fear of losing their favorable handicap. Ideally, 
the social managers would have to know the potentialities of every 
individual; and to remedy the distortions of imperfect lump-sum 
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taxes, they would have to set a system of quotas and penalties 
based upon potentialities rather than upon performance. 

Thus, we might decide that everyone should have at least a 
minimum income, that Society will make up the deficiency between 
what the less fortunate can earn and this minimum. Once this is 
realized by those who fall below the minimum, there is no longer 
an incentive for them to work at the margin, at least in pecuniary 
material terms. This is clearly bad social policy, not because I 
have a vulgar prejudice in favor of work and against leisure. On 
the contrary, the increases in real income in the years ahead prob- 
ably will be spent in considerable degree on leisure. It is wrong 
because it forces the rest of society to give up leisure. Its fault lies 
in the fact that the individual’s allowance is not a lump-sum one. 
It varies* inversely with his effort, and thus penalizes effort. It 
would be tolerable only if a small percentage of the population 
were below the minimum, or if we could rely upon new motivations 
in the Utopia of the future. I, myself, put considerable stock in 
the possibility of changing conventional patterns of motivation. 
However, this will not provide comfort to those who wish to utilize 
a parametric pricing system with algebraic lump-sum allowances, 
since these same considerations undermine the “individualistic” 
assumptions upon which their analysis is based . 36 

Fourth, because decisions need only be made concerning in- 
comes, the problem of formulating political slogans and beliefs 
which will command wide approbation is simplified.. However de- 
sirable this may be from a political point of view, it must never be 
forgotten that from a consistent ethical point of view decisions 
should be made concerning the welfare function itself. Beliefs con- 
cerning the distribution of income are derivative rather than funda- 
mental. Except in the admittedly unrealistic case where all tastes 
are identical, setting up such beliefs as goals is equivalent to ac- 
cepting a “shibboleth” and to embracing an ambiguous, undefin- 

38 The clash between equity and incentive is not unimportant for the present stage 
of modern capitalism. To an important first approximation the adverse effects of high 
marginal rates of taxation stem not from the level of the rates, provided they are less 
than 100 per cent, but from the “curvature” of the tax formula whereby gains increase 
tax liability more than equivalent losses reduce tax liability. But the essence of “pro- 
gression” in taxation and in income redistribution is curvature. The only solution 
lies in pooling risks, and in drafting tax legislation so as to produce a maximum of 
averaging over time. If this does not suffice, equity may still be worth considerable 
unavoidable costs. 
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able welfare function. Among other things such a procedure in- 
volves making a shibboleth of the existing distribution of relative 
technological scarcities of goods. (Of course, whether this is a bad 
practice is not in itself part of the content of welfare economics, 
which does not presume to deduce appropriate beliefs. However, 
welfare economics can legitimately point out the implications of 
different ethical propositions.) 

New Versus Old Welfare Economics 

While in a real sense there is only one all-inclusive welfare eco- 
nomics, which reaches its most complete formulation in the writings 
of Bergson, it is possible to distinguish between the New Welfare 
Economics, which involves roughly the contents of the sections on 
production and exchange and which makes no assumptions con- 
cerning interpersonal comparability of utility, and the Old Welfare 
Economics which starts out with such assumptions. Roughly it 
is the distinction between Pareto and Pigou. From our above dis- 
cussion it will be clear that the former is included in the latter, but 
not vice versa. 

Strictly speaking there is no opposition between the two points 
of view. However, it is not uncommon for expositors of the “new” 
set of doctrines to imagine that their results have significance even 
if one is unwilling to make any ethical assumptions. In fact this 
belief is almost a necessity for anyone who has taken seriously 
Robbins’ dicta concerning the inadmissibility of welfare economics 
in the body of economic theory. Once it is apprehended that the 
latter notion is a delusion, the need to believe in the significance of 
the production and exchange conditions divorced from the inter- 
personal conditions will disappear. Nevertheless, it may be de- 
sirable to appraise their significance taken by themselves. 

I pass over as being obvious from our discussion of the first six 
assumptions about the W function that it is not literally true that 
the new welfare economics is devoid of any ethical assumptions. 
Admittedly, however, its assumptions are more general and less 
controversial, and it is for this reason that it gives incomplete 
necessary conditions, whose full significance emerges only after one 
has made interpersonal assumptions. To refuse to take the last 
step renders the first two steps nugatory; like pouring out a glass 
of water and then refusing to drink. To say that the first two 
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conditions should be realized, but that the third is meaningless, is 
like saying that it does not matter whether or not a man has hair, 
so long as it is curly ! 

A limited significance remains for the new welfare economics if 
we hold that a welfare function is definable but undefined, and if 
we look for conditions which hold true uniformly for all possible 
definitions. It cannot tell us which of any two situations is better, 
but it can occasionally rule out one given situation as being worse 
than another in the sense that everyone is worse off. It cannot tell 
us when society really has a choice between two given situations. 
Most important of all it cannot tell us that a movement about 
which it can give a determinate answer is better than a movement 
about \yhich it cannot give an answer. If we are at a point not on 
the Possibility Function, it assures us that there exists a still better 
point. But it cannot assert that a given point on the Possibility 
Function is better than all, or many, points not on the Possi- 
bility locus. 

Concretely, the new welfare economics is supposed to be able 
to throw light on such questions as to whether the Com Laws 
should have been repealed. This act would help many individuals, 
but hurt landowners. It is usually assumed, although not neces- 
sarily quite rigorously, that free trade can be expected to lead to a 
new “situation” in which those who are benefited can afford to 
bribe those who are hurt. It is thought that this constitutes a 
“case” for free trade. Actually, this leads to no positive guide to 
action. For the new welfare economics cannot state definitely: 
“The Com Laws should be repealed, and the landowners should be 
compensated.” Occasionally such a statement is implicit in a 
writer’s formulation, but it is clear that this involves an unwar- 
ranted assumption as to the adequacy of the status quo, an assump- 
tion which the new welfare economics holds to be neither right nor 
wrong, but meaningless. 

On the other hand, the new welfare economics cannot make the 
definite statement: “The Corn Laws should be repealed, and the 
owners should not be compensated.” It can only assert in a nega- 
tive way: “ ’Twere better that the Com Laws be repealed, and 
compensation be paid, if necessary .” It gives no real guide to 
action. Nor does our experience idth man as a social animal sug- 
gest that one can safely predict, as a factual matter, that “educated 
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and intelligent men of goodwill” in point of fact tend to move to 
the generalized contract locus. As an empirical statement of fact 
we cannot agree with the assertion of Edgeworth that bilateral 
monopolists must end up somewhere on the contract curve. They 
may end up elsewhere, because one or both is unwilling to discuss 
the possibility of making a mutually favorable movement for fear 
that the discussion may imperil the existing tolerable status quo. 

Scitovsky 37 has attempted to remedy certain deficiencies in the 
usual statement of the new welfare economists by developing a 
double test of the desirability of two situations, in which the dis- 
tribution of income of the new situation is treated symmetrically 
with that of the old situation. While his is an improvement on 
certain earlier statements, it is so primarily in a negative way, to 
the extent that it delimits the scope of the new welfare economics. 
His own positive formulation is also a “ ’twere better” statement, 
and not a positive guide to action. Such statements are not with- 
out value, but they are not at all substitutes for the policy dictates 
which stemmed from the old welfare economics. 

Withing the limited scope of the “ ’twere better” statements 
there is an ambiguity which Scitovsky alone seems to have sensed. 
The term “situation” can mean a number of different things. It 
can refer to an actual position reached by every individual prior to 
the repeal of a tariff, and to the new situation which would actually 
be attained by each individual after new action is taken, i.e., repeal 
with varying or no amounts of compensation. Or it can mean the 
totality of all possible positions available with the Com Laws un- 
repealed, and the totality of all possible positions with the Corn 
Laws repealed. The latter is the more significant sense, and can 
be given significance in terms of the Possibility Function developed 
above. Thus, a perfectly legitimate “ 'twere better” statement 
can be made as follows: “Technological change can make everybody 
better off in the sense that it will shift the Possibility Function out- 
wards.” 38 We cannot deduce from this the dictum: “Techno- 
logical change is a good thing,” since the introduction of techno- 
logical change will in fact mean a vector movement from the old 

37 T. Scitovsky, “A Note on Welfare Propositions in Economics,” and “A Recon- 
sideration of the Theory of Tariffs,” Review of Economic Studies, IX (1941-42), 77-88 
and 89-110. 

38 In effect Scitovsky recognizes the case where the Possibility Function is given a 
twist rather than everywhere being shifted outward. 


FOUNDATIONS OF ECONOMIC ANALYSIS 


252 

positions of every individual to their new positions which can 
hardly bode good to all. 

At this point the new welfare economist will fall back on the 
vague statement that “probably technological change will result 
in good on balance.” Not only is this argument on its probability 
side based upon ignorance rather than knowledge, but it is meaning- 
less as well unless the admissibility of a W function is admitted. 

In closing I should like to point out that the point of view which 
I have been calling the new welfare economics is really only a 
caricature. On the whole, Pareto, Barone, Hotelling, and Lerner 
sidestepped interpersonal judgments rather than denied them. 
Bergson synthesizes the various aspects, as does Lange. Whether 
Kaldor .and Hicks are open to the criticism along the lines which 
Stigler has recently advanced , 39 the reader must decide. 

Conclusion 

In the present chapter I have attempted to give a brief but 
fairly complete survey of the whole field of welfare economics. It 
would be possible to develop the subject further in many respects 
and to consider a number of related problems. However, limita- 
tions of space preclude this, and I shall content myself with two 
final remarks. 

First, what is the best procedure if for some reason a number 
of the optimum conditions are not realized? What shall we do 
about the remaining ones which are in our power? Shall we argue 
that “two wrongs do not make a right” and attempt to satisfy 
those we can? Or is it possible that failure of a number of the 
conditions necessitates modifying the rest? Clearly the latter 
alternative is the correct one. A given divergence in a subset of 
the optimum conditions necessitates alterations in the remaining 
ones. Thus, in a world where almost all industries are producing 
at marginal social cost less than price (either because of monopoly 
or external economies) it would not be desirable for the rest to 

39 N. Kaldor, “Welfare Propositions in Economics,” Economic Journal , XLIX 
(1939), 549-552; J. R. Hicks, “Foundations of Welfare Economics,” Economic Journal , 
XLIX (1939), 696-712; G. J. Stigler, “The New Welfare Economics,” American Eco- 
nomic Review, XXXIII (1943), 355-359; P. A. Samuelson, “Further Commentary on 
Welfare Economics,” American Economic Review , XXXIII (1943), 604-607. See also 
P. A. Samuelson, “Welfare Economics and International Trade,” American Economic 
Review , XXVIII (1938), 261-266. 
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produce up to the point where marginal cost equals price. Neither 
would it be quite correct to seek the same percentage or absolute 
divergence from the optimum conditions in each case ; although in 
this particular example, if the elasticity of the supplies of the factors 
of production were zero, the proportionality of prices to marginal 
cost would be as good as the exact equality. Still another example 
to show that failure of some conditions necessitates alteration of 
the rest is provided by the possibility of increasing welfare by de- 
liberately selling below marginal costs to groups with a high mar- 
ginal (social) utility of income. Given a faulty distribution of 
income, this can improve the situation, although it would be still 
better to have the full optimal conditions realized. 

The last point consists of the warning that the introduction of 
dynamic conditions into our analysis necessitates a considerable 
change in the statement of optimal conditions . 40 The difference is 
not one of principle, but it is nevertheless important. Judged 
purely on statical grounds, monopolies or a patent system may 
appear as unmitigated evils, and certainly inferior to atomistic 
competition and free trade. But in a dynamical world these judg- 
ments may have to be reversed ; viz. the infant industry argument 
for protection, the stimulus to large scale research which only a 
monopolist can afford, the (alleged) necessity to hold out incentives 
to inventors, etc. Indeed the measure of support which capitalism 
commands is most importantly related to precisely these factors 
of development. 

In admitting the superiority of monopoly to atomistic competi- 
tion in certain respects, we do not imply that it is the best possible 
organization of an industry. There necessarily exists a still better 
third alternative, which may or may not be less Utopian than the 
restoration and maintenance of atomistic competition. 

40 Arguments which allege that individuals, under capitalism or socialism, decide 
what goods they shall consume, but are not capable of making a correct decision with 
respect to saving seem to suggest that there is a qualitative difference introduced by 
dynamics. I refer to the argument that wealthy men are necessary in order to provide 
for saving and capital formation, and that in a socialist state the government should 
decide what the appropriate amounts of capital formation should be. If one specifies 
a W function satisfying the first six assumptions and includes in it as separate variables 
future goods and services, both of these views are wrong whether there is capitalism or 
socialism. However, especially where problems of the present and future are concerned, 
modern philosophers balk at the assumption that what people think is best for them 
really is. Naturally, when the individualistic assumption six is dropped, our conditions 
are altered. But the same would be true if we denied this assumption in a statical world. 
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CHAPTER IX 


THE STABILITY OF EQUILIBRIUM: COMPARATIVE 
STATICS AND DYNAMICS 

Introduction 

It was an achievement of the first magnitude for the older mathe- 
matical economists to have shown that the number of independent 
and consistent economic relations was in a wide variety of cases 
sufficient to determine the equilibrium values of unknown economic 
prices and quantities. Since their life spans were only of finite 
duration, it was natural that they should have stopped short at 
this stage of counting equations and unknowns. It remains to be 
explained, however, why in the first quarter of the twentieth cen- 
tury economists should have been content with what was after all 
only preliminary spade work containing in itself (at least ex- 
plicitly) few meaningful theorems of observational significance such 
as could even ideally be empirically refuted under any conceivable 
circumstances. 

It is the task of comparative statics to show the determination 
of the equilibrium values of given variables (unknowns) under pos- 
tulated conditions (functional relationships) with various data 
(parameters) being specified. Thus, in the simplest case of a 
partial-equilibrium market for a single commodity, the two inde- 
pendent relations of supply and demand, each drawn up with other 
prices and institutional data being taken as given, determine by 
their intersection the equilibrium quantities of the unknown price 
and quantity sold. If no more than this could be said, the econo- 
mist would be truly vulnerable to the gibe that he is only a parrot 
taught to say “supply and demand.” Simply to know that there 
are efficacious “laws” determining equilibrium tells us nothing of 
the character of these laws. In order for the analysis to be useful 
it must provide information concerning the way in which our equi- 
librium quantities will change as a result of changes in the param- 
eters taken as independent data. 
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In the above illustration let us consider “tastes” as a changing 
parameter influencing only the demand curve. Will an increase in 
demand raise or lower price? Clearly the statement that, before 
and after the assumed change, price is determined by the inter- 
section of supply and demand gives us no answer to the problem. 
Nothing can be said concerning the movement of the intersection 
point of any two plane curves as one of them shifts. And yet most 
economists would argue that in a wide variety of circumstances 
this question can be given a definite answer — namely that price 
wiil increase. 

How is this conclusion derived? For few commodities have we 
detailed quantitative empirical information concerning the exact 
forms of fhe supply and demand curves even in the neighborhood 
of the equilibrium point. Not only would large amounts of time 
and money be necessary to get such information, but in many cases 
it is practically impossible to derive useful empirical information 
concerning what would happen if virtual changes in price con- 
fronted the demanders or the suppliers. 

This is a typical problem confronting the economist: in the 
absence of precise quantitative data he must infer analytically the 
qualitative direction of movement of a complex system. What 
little success he has hitherto achieved can be classified in large part 
under two headings: (1) theorems proceeding from the assumption 
of maximizing behavior on the part of firms or individuals, and 
(2) stability conditions relating to the interaction between economic 
units. Although inadequately explored until comparatively re- 
cently, the first type of conditions is best known and will not be 
dealt with here except incidentally. As will become evident later, 
however, from certain points of view they can be fitted in as special 
cases of the second set. It is the central task of this chapter to 
show how the problem of stability of equilibrium is intimately tied 
up with the problem of deriving fruitful theorems in comparative 
statics. This duality constitutes what I have called the corre- 
spondence principle. 

Comparative Statics 

The problem may be approached in full generality by consider- 
ing n unknown variables (xi, ■ • • , x n ) whose equilibrium values are 
to be determined for preassigned values of a parameter, a. We 
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assume n independent and consistent, continuously differentiable 
implicit relations involving some or all of the unknowns and the 
parameter a ; or 

f(xu * » Xn} ” 0 . (i — 1 , ■ * b , fi) ( 1 ) 

These determine a set of equilibrium values 1 

Xi° = g;(a). ( 2 ) 

We wish to determine the sign of 

fr “ *'<“>• < 3 > 

Differentiating (1) totally with respect to a, we can express this as 


dxf 

da 


£ fJ^i 

i — l 


( 4 ) 


where the subscripts indicate partial differentiation. 


= !//!> 



/ 1 1 

/ 2 1 

/n 1 

A = 

/ 1 2 

/ 2 2 

/n 2 


fi n 

/ 2" 

••• u 


and A,-i is the cofactor of the element in the jth row and ith. column 

of A. 

Unless some a priori restrictions are placed upon the nature of 
the elements involved in these determinants, no useful theorems 
can be derived. Each unknown derivative depends upon an 
n(n + 1) infinity of possible values. If the various determinants 
were expanded out, a sum of n ! terms would appear in the denomi- 
nator and in the numerator. Regarded simply as chance drawings 
taken at random from a hat, the probability that the signs of these 
would all agree would go rapidly to zero as the number of variables 
increased. Fortunately, as will be shown, the analysis of the 
stability of equilibrium will aid in evaluating these complicated 
expressions. 

1 If for a given value of a — ai, there exists a solution (xi°, * * * , x n °) t and if the 
matrix /dxf] is of rank n in a neighborhood of (s 0 )» then by the implicit-function 
theorem equations (2) represent single- valued continuously differentiable functions in 
a sufficiently small neighborhood of (on, oc°). 
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In the simple example of supply and demand alluded to above 
our unknowns are (p, q) , and our equilibrium system can be written : 


| q - D(p, a) = 0, {D a > 0 ,D P < 0) 
i q - S&) = 0, 


where a is a parameter of shift representing “taste” and D p is 
usually assumed to be less than zero. Also 


dp> 

da 

dq° 

da 


= z>« 1 

■ L/ “ Q 0 _ T) 9 J 

KJjp JLS p 

(6) 

Sr , 0 


— d 0 p 

“ S p ° - D p ° 

(7) 


Whether or not price will increase when demand increases is 
seen to depend upon the algebraic difference between the slopes (re- 
ferred to the price axis) of the demand and supply curves at the 
equilibrium point. Quantity will increase only if the slope of the 
supply curve is of the same sign as this algebraic difference. If the 
system is stable in the sense of Walras, it can be shown that the 
supply curve must have a greater algebraic slope than the demand 
curve so that price will necessarily increase ; the change in quantity 
is necessarily of ambiguous sign depending upon whether the supply 
curve is positively inclined jMr*so-called “backward-rising.” 2 


Stability and Dynamics 

Before deriving explicitly the Walrasian stability conditions re- 
ferred to above, I turn to a discussion of the meaning of a stable 
equilibrium. This will be found to presuppose a theory of dy- 
namics, namely a theory which determines the behavior through 
time of all variables from arbitrary initial conditions. If we have 
given n variables (jri(J), • • •, #*(/)][, and n functional equations of 

2 The distinction suggested by Mr. Kahn between “forward-falling” and “backward- 
rising” negatively inclined supply curves while suggestive does not rest upon a dynamic 
analysis of the attainment of equilibrium and so does not adequately come to grips with 
the problem in all its complexity. Cf. R. F. Kahn, “The Elasticity of Substitution and 
the Relative Share of a Factor,” Review of Economic Studies , I (1933), 72-78; also N. 
Kaldor, “A Classifactory Note on the Determinateness of Equilibrium,” Review of 
Economic Studies , I (1933), 122-136. 

The suggestion of Professor Viner that the latter type of curve gives the maximum 
amount forthcoming at a given price while the former does not will receive amplification 
in the course of this argument. 
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the general form 

■F’OiO), x 2 (r), • • x n (r), t] = 0, (* = 1, • • •, n) (8) 

— ao — 00 — 00 

then their behavior is determined once certain initial conditions are 
specified. 3 Examples of functional equation systems are given by 
sets of differential, difference, mixed differential-difference, integral, 
integro-differential, and still more general systems. Following the 
excellent terminology of Professor Frisch, 4 stationary or equilibrium 
values of the variables are given by the set of constants ( xf , , x n °) 
which satisfy these equations identically, or 

f^Oi 0 , x 2 °, • • ■ , x„°, t} = 0. 5 (i = 1, •■*,»), (9) 

— 00 —00 —00 

If the system has always been in equilibrium up until time t°, it 
will subsequently continue in equilibrium. However, the equi- 
librium values (xF, • • • , x n °) may be attained or even be maintained 
for a finite period of time, and yet because of generalized dynamical 
“inertia” it need not (and in general will not) remain in equilibrium 
subsequently, but may well “overshoot” the mark. 

The equilibrium position possesses perfect stability of the first 
kind if from any initial conditions all the variables approach their 
equilibrium values in the limit as time becomes infinite; i.e., if 

lim Xi(t) = Xi°, (10) 

t—*co 

regardless of the initial conditions. Alternatively, it is sometimes 
stated that an equilibrium is stable if a displacement from equi- 

3 What constitute initial conditions depends upon the nature of the functional equa- 
tions. For differential systems only values of the coordinates, velocities, and higher 
derivatives at some initial time need be specified. For difference equations defined only 
for integral values of t the same is true, where differences replace derivatives. In the 
general case values of the variables over a continuous time interval, possibly stretching 
back to — co, are required to constitute a complete set of initial conditions. 

4 R. Frisch, “On the Notion of Equilibrium and Disequilibrium/’ Review of Economic 
Studies , III (1936), 100-105. 

5 Of course, such a set need not exist. Thus, the simple system 



has no stationary equilibrium values since e? — x = 0 has no real roots. Similarly, 
dx/dt — 1 defines no stationary equilibrium position. 
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librium is followed by a return to equilibrium. A displacement is 
equivalent to an arbitrary change in the initial conditions and is 
possible only if some of our functional equations are momentarily 
relaxed or if our system is enlarged to include impressed forces 
or shocks. 

Stability of the first kind in the small exists if for sufficiently 
small displacements the equilibrium is stable. Stability in the 
small is contained within perfect stability but not vice versa. A 
system may be stable for small finite displacements but not for 
large displacements. Nevertheless, stability in the small is a 
necessary condition for perfect stability and will be analyzed here 
in greatest detail. 

It shpuld be pointed out that no conservative dynamical system 
of the type met in theoretical physics possesses stability of the first 
kind. If one displaces a frictionless pendulum, it will oscillate 
endlessly around the position of stable equilibrium . 6 Its motion 
is bounded, however, and it never remains on one side of the equi- 
librium position for more than a finite time interval. Such be- 
havior may be characterized as stability of the second kind or as 
stability in the second sense. As before, a distinction can be made 
between stability of the second kind in the small and complete 
stability of the second kind. For the most part in the present 
investigation I shall be concerned with the problem of stability of 
the first kind. 

The equations of comparative statics are then a special case of 
the general dynamic analysis. They can indeed be discussed ab- 
stracting completely from dynamical analysis. In the history of 
mechanics, the theory of statics was developed before the dy- 
namical problem was even formulated. But the problem of sta- 
bility of equilibrium cannot be discussed except with reference to 
dynamical considerations, however implicit and rudimentary . 7 
We find ourselves confronted with this paradox: in order for the 
comparative-statics analysis to yield fruitful results, we must first 

6 A dynamical system into which friction is introduced via a dissipation function 
may enjoy stability of the first kind. On these and kindred matters see G. D. Birkhoff, 
Dynamical Systems (New York, 1927). 

7 This is seen to be involved in the virtual-work analysis and in the minimum- 
potential-energy condition characteristic of a stable statical (“stationary”) equilibrium 
position. 


STABILITY AND COMPARATIVE STATICS 263 

develop a theory of dynamics. 8 This is completely aside from the 
other uses of dynamic analysis as in the studies of fluctuations, 
trends, etc. I turn now to some illustrations of these propositions. 

I. In the literary explanations of the process by which supply 
and demand are equated, the assumption is usually made that if 
at any price demand exceeds supply, price will rise ; if supply ex- 
ceeds demand, price will fall. Let us state this more precisely as 
follows: 

v = % = h( & d - 2s) = #[£>(£, «) - S(p)l, (11) 

where H( 0) = 0, and H r > 0. 

In the neighborhood of the equilibrium point this can be ex- 
panded in the form 

V = \{D P ° - S P °)(p - p°) -4 , (12) 

where X = (H')° > 0, and where terms involving higher powers of 
(p — p°) are omitted. The solution of this simple differential 
equation for initial price p at time zero can be written at sight : 

p(t) = p° + ip - p°)e HD, ‘~ Sr ‘ )t . (13) 

If the equilibrium is to be stable, 

lim pit) = p°. (14) 

t — *00 

This is possible if, and only if, 

D p ° - S p ° ^ 0. (15) 

If in what follows we rule out neutral equilibrium in the large and 
in the small, the equality sign may be omitted so that 

D p ° - S p ° < 0. (16) 

If the supply curve is positively inclined, this will be realized. If 
it is negatively inclined, it must be less steep (referred to the price 
axis) than the demand curve. If our stability conditions are 
realized, the problem originally proposed is answered. Price must 
rise when demand increases. 

8 The point made here is not to be confused with the commonplace criticism of 
comparative statics that it does not do what it is not aimed to do, namely describe 
the transition paths between equilibria. 
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II. These so-called Walrasian stability conditions are not neces- 
sarily the only ones. 9 If alternative dynamic models are postu- 
lated, completely different conditions are deduced, which in turn 
lead to alternative theorems in comparative statics. 

Thus, in Marshall’s long-run theory of normal price the quan- 
tity supplied is assumed to adjust itself comparatively slowly. If 
“demand price” exceeds “supply price,” the quantity supplied will 
increase. Preserving our notation of equations (5), remembering 
that quantity rather than price is regarded as the independent 
variable, and neglecting higher-order terms, we have the following 
differential equation 



whose solution is 

q(t) = g° + (2 — 

If the equilibrium is to be stable, 

J L = 1 ~ Dp 0 0 

D p ° S v ° Dp 0 S v ° ’ 


(17) 

(18) 

(19) 


i.e., the demand-curve slope referred to the quantity axis is less 
algebraically than that of the supply curve. Since the demand 
curve is negatively inclined, 



( 20 ) 


Referring back to equations (7), we see that Marshallian stability 
conditions require that quantity increases when demand increases 
in every case, while the change in price is necessarily ambiguous 
depending upon the algebraic sign of the supply curve’s slope. 

It is to be pointed out that this forward-falling supply curve is 
not a true supply curve in the sense of the amount forthcoming at 
each hypothetical price, although it is a true supply curve in the 
sense of being the locus of price-quantity points traced out by 
fluctuations in sufficiently steep demand curves. As such it is a 
reversible long-run relation. 

9 A historical error is involved in the identification of the above stability conditions 
with Walras in alleged contrast to those of Marshall which are shortly to be discussed. 
Actually as far back as in the Pure Theory of Foreign Trade Marshall defined stable 
equilibrium, in which a so-called backward rising supply curve was involved, much like 
the Walrasian case. 
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III. Still another dynamic model may be considered. It has 
been held that for some commodities supply reacts to price only 
after a given time lag, while price adjusts itself almost instantane- 
ously. This leads to the familiar cobweb phenomenon. Using the 
same notation, our dynamic model takes the form of the following 
difference equations, 


2« — S(p t - 1 ), 
q t = D{p t , a). 

(21) 

In the neighborhood of equilibrium 


(q t - go) = (§ 4 ) (2i-i - 2°) 

(22) 

with the solution 


2< = 2° + (2-2 °)(§4)‘- 

(23) 

Stability requires that 


C 0 

p <1 

D P ° ^ U 

(24) 


If the supply curve is positively inclined, it must be steeper abso- 
lutely than is the demand curve, reference being made to the 
quantity axis. In this case the approach to equilibrium is of a 
damped oscillatory nature, every other observation being on one 
side of the equilibrium value. 

If the supply curve is negatively inclined, it must be steeper 
referred to the quantity axis than is the demand curve, precisely 
as in the case of Walrasian stability. The approach to equilibrium 
is asymptotic. As in the Walrasian case, we can deduce the 
theorem in comparative statics that price will necessarily increase 
even though the change in quantity is indefinite. 

It is to be noted that a first-order difference equation is richer in 
solution that the corresponding first-order differential equation. 
Not only does it admit of oscillatory solutions, but the stability 
conditions relate to the absolute value of the root of an equation, 
implying two distinct inequalities. Remembering that D p ° is nega- 
tive, the inequality of (24) can be written 

D p ° < S P ° < - Dp 0 . 

The new inequality tells us that any increase in output as a result 
of an increase in demand cannot be so large as the increase in output 
resulting from an “equivalent” increase in supply. 


266 FOUNDATIONS OF ECONOMIC ANALYSIS 

IV. Still a fourth dynamical model that has been considered is 
that of Marshall in the Pure Theory of Foreign Trade. Let Figure 2 
represent the familiar offer curves of two trading bodies (suppliers 
and demanders) respectively. Equilibrium is attained at the in- 
tersection (not necessarily unique) of two such curves. If equi- 



librium is displaced, country I is to act in such a way as to change 
the amount of x 2 in the horizontal direction of its offer curve 
(as indicated by the pointed horizontal arrows). Similarly coun- 
try II adjusts x 2 vertically in the direction of its offer curve. 
Mathematically, 

in = Hi[G(x 2 ) - x J, 

= H 2 [_F(xi) — x 2 ], 


(25) 


where FI/ > 0, H»( 0) = 0, and G(xf) — x 2 = 0, F(x i) — x 2 = 0 
represent the statical offer curves of countries I and II respectively. 
If units are properly chosen, the following system of differential 
equations will hold in the neighborhood of equilibrium, 

Xl — — (Xi - Xi 0 ) 4- (G')°(x 2 — x 2 °), 
x 2 = (F')°(x i — xi°) — (x 2 — x 2 °). 


(26) 


The solution takes the form: 


Xi(t) = x/ + k n e > ' lt + kne Xzt , 

x 2 (i) = x 2 ° + ^ 2 ie Xl * + ^ 22 e X2 ‘, 


(27) 


where the k’s depend upon the initial values (xi, x 2 ) and the X’s are 
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roots of the characteristic equation 

- 1 - X 

(F')° 

Clearly 


D(\) 


(G'Y 

• 1 - X 


(28) 


X = - 1 ± V(G0W- (29) 

The equilibrium will be stable if the real part of X is necessarily 
negative, or 

R(\) < 0. (30) 


If both ( G')° and (F')° are of opposite sign (e.g. , if one has an elastic 
demand, the other an inelastic demand), this condition will neces- 
sarily be satisfied. The solution will be oscillatory, but damped, 
approaching equilibrium in a spiral as shown above in Figure 3, 
and obeying an equation of the form 


Xi = Xi° + e~‘(a,- sin 6t + bi cos 6t). (i = 1, 2) (31) 

If both are positive, however (each with elastic demands), then 


V (G')°(F')° < 1, 

(32) 

(G')°(F')° < 1, 

(33) 

(«')• < XFy- 

(34) 


In terms of the slopes of both offer curves referred to the x\ axis 


/ dxi\ > ( dx£ \ 
\dx 1/1 ^ \dxijn 


(35) 


The equilibrium is approached asymptotically. 

If both curves are negatively inclined, stability requires 


/ dx{\ / dxj \ 

\dxi)i < Uxi /11 


(36) 


Clearly, the general condition when the curves are of like sign can 
be written 


doc 2 


dx 2 

dx 1 

> 

1 

dx 1 


(37) 


and the approach to equilibrium will in every case be asymptotic. 10 

10 Somewhat paradoxically, in this case positions of stable equilibrium need not.be 
separated by positions of unstable equilibrium because of the possibility of complex roots. 
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The stability conditions derived here will be found, if translated 
into terms of supply and demand curves rather than offer curves, 
to imply differing and inconsistent conditions from those of the 
preceding cases. 

V. In the four cases considered I have been concerned with 
problems of stability of the first kind. Following a suggestion of 
Dr. Francis Dresch of the University of California, let us suppose 
that price falls not when the instantaneous supply exceeds demand, 
but only when accumulated stocks exceed some normal value, Q°, or 

V = MQ° - Q) + • • • = IQ° - x f * (as - q D )dt, (X > 0) (38) 

Jo 

since the stock equals the accumulated difference between the 
amount produced and the amount consumed. Differentiating with 
respect to t, neglecting terms of higher power, and adhering to our 
previous notation, we have 

p = (ZV - S p °)(p - p°), (39) 

whose solution is 

pit) = p° + Cl e^ D >° ~ Sp ° )l + ae~^ Dp °~ St ° )l , (40) 

the c’s depending upon the initial price and price change. 

Only if 

Dp 0 - S p ° < 0, (41) 

can explosive behavior be avoided. If the above inequality is 
realized, however, the square root will be a pure imaginary number 
so that the solution takes the form of an undamped harmonic : 

pit) = bi cos ■ylSp 0 — D p ° t + bz sin V5/ — D p ° t -f- P°- (42) 

Thus, if we require our second-order differential equation to have 
at least stability of the second kind, we come to the same theorems 
in comparative statics as in case I. 

There is at least one serious objection to assuming a nondamped 
system of this kind. If there are superimposed upon our system 
random shocks or errors, these will tend to accumulate so that the 
expected amplitude of the cycles will increase with time. This is 
well illustrated by the familiar Brownian movement of large mole- 
cules under the impact of random collisions. The molecule “takes 
a random walk,” and its mean variance increases with the observa- 
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tion time . 11 Before adopting a similar hypothesis in economic 
analysis, some statistical evidence of its possible validity should 
be adduced . 12 

We have now surveyed five different dynamic setups and re- 
lated stability conditions, all referring to a simple one-commodity 
market. Except possibly for cases IV and V, all are mathemati- 
cally trivial. Unaided intuition or simple geometrical methods 
serve to reveal sufficient conditions for stability. They are of sig- 
nificance, however, because each has played an important part in 
the history of economic science ; and precisely because of their sim- 
plicity they provide a useful illustration of the general principle 
involved. In the following sections I shall be concerned with more 
complex problems. 


The Stability of Multiple Markets 

While it might be more elegant at this stage to develop formally 
for general systems the fundamental principles illustrated thus far, 
our foregoing discussion provides a very convenient opening for 
an examination of a problem which has received considerable at- 
tention lately at the hands of Professor Hicks. In Value and 


11 Random shocks are not necessarily to be regarded as a nuisance. In their absence 
friction might imprison the system at some fixed level other than the “true” equilibrium 
level (friction being disregarded). Often random shocks serve to insure the realization 
of average values nearly equal to the equilibrium ones, just as iron filings placed upon 
a piece of paper over a magnet assume the lines of force of the magnetic field when 
gently tapped. 

12 One can avoid an undamped system by assuming that price tends to fall not 
only when stocks are large, but also when current supply exceeds current demand; 
i.e., when stocks tend to accumulate. Then we have 

p - a <2° — (qs — qn)di J — (3(qs — qn) 
or 

p = a(iy ~ S p °)p + /3(ZV - S p °)p. (a, j3 > 0) 


The equilibrium is stable only if 
or if 
or if 


R(\) < 0, 

X 2 - a(D p ° - S p °)\ - i S(zy - 5/) = 0, 
D P ° - S p ° < 0. 


This agrees with the condition of case I and the one just derived. In fact, each of these 
is a special case when one of the coefficients vanishes. For intermediate values, the 
solutions range continuously between damped harmonic motion and exponential 
approach to equilibrium. 
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Capital, Chapter VIII and Mathematical Appendix, §21, he has at- 
tempted to generalize to any number of markets the stability con- 
ditions of a single market. The method of approach is postula- 
tional ; stability conditions are not deduced from a dynamic model 
except implicitly . 13 Propositions which are deduced here as 
theorems are assumed as definitions of stability. 

For a single market, according to Professor Hicks, equilibrium 
is stable if an increase in demand raises price. (This rules out in 
the beginning cases II and IV.) For multiple markets equilibrium 
is imperfectly stable if an increase in demand for a single good raises 
its price after all other prices have adjusted themselves; the equi- 
librium is perfectly stable if an increased demand for a good raises 
its price even when any subset of other prices is arbitrarily held 
constant (by means of a relaxation of other equilibrium conditions). 

To test the necessity or sufficiency of these criteria in terms of 
a more fundamental definition of stability of equilibrium let us 
make a natural generalization of the Walrasian conditions of the 
following form : the price of any good will fall if its supply exceeds its 
demand, these each being regarded as functions of all other prices. 

Mathematically, 

Pi = — Hfq s i — 

= - Hilqsfpl, ■ • • , pn) - qD i {pl, ■■ ■, pn)2 
= H' t ai f(Pj - pf) + • • : J , ( 43 ) 

3=1 

where 

0 = qsKpi, • • • , Pn ) - qofpi, • • * ,pn ) = - qfpi, •••,£») ( 44 ) 

represent statical equations of supply and demand, a if represents 
the partial derivative of q »• with respect to the jth price evaluated 
at the equilibrium set of prices. In general, a if 5 ^ a ,/. 14 It is 

13 It is true that on page 70 a hint of a dynamical process creeps into the discussion. 
The approach to equilibrium seems to be regarded as taking place in finite steps at dis- 
crete intervals of time; i.e,, in accordance with certain difference equations. Correctly 
stated, this argument would not lead to essentially different stability conditions from 
my system of differential equations discussed below, as the later general discussion 
will disclose. 

14 If the demand and supply were each drawn up with reference to firms maximizing 
profit, well-known integrability conditions would guarantee this equivalence. On the 
consumer’s side there need be no such equivalence, and if we consider a consumer whose 
total purchases balance his total sale of productive services, such an equality for every 
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instructive to consider first, however, the symmetrical case (such 
as characterizes markets made up] exclusively of entrepreneurs) 
where the speeds of adjustments, H/, are set equal to one. The 
solution of equations (43) can be written 


pm 


n 


= Pf + Z k i} e^\ 

i=i 


(45) 


where (Xj, • • • , \ n ) are latent roots of the characteristic equation 
and 



an 0 

— X 

* * O'ln 0 

m = 

a 2 i° 

a n 0 — X 

0*2 


dm 0 

a n 2° 

■ ■ a„n - X 

= 

\a — 

XI | = | dif — X5 ij | 

= o, 


(46) 


the k’s depend upon the matrix a and upon the initial conditions. 15 
As before, stability requires i?(Xy) < 0. 

By a well-known theorem of Hermitian matrices, in the sym- 
metrical case all the roots are necessarily real. If the equilibrium 
is to be stable, they must all be negative. According to a classical 
theorem, this is possible if and only if a is the matrix of a negative 
definite quadratic form; i.e., only if all principal minors alternate 
in sign as follows: 


I at/ 1 < 0; 


da 0 0*ij° 

a 3i ° a 3j ° 


> 0 ; 


au° 

O'if 

&ik° 


an 0 



< 0, 

aki° 

a k j° 

O'hh 0 




i 

j k i. (47) 


combination of goods and services would, strictly interpreted, lead to an absurdity; it 
would imply expenditure proportionality and, Hence, zero consumption of every good and 
zero offering of every service! For the general demand or supply function we need not 
expect a canceling off of “income effects” since individuals usually face firms in consump- 
tion and factor markets. 

15 If the roots are not distinct, polynomial terms of the form te Xt } £ 2 g Xt , * * * , £*e Xi 
appear where (s + 1) is the multiplicity of a repeated root. In any case the problem 
of stability depends only upon the Vs and is unaffected by such multipliers since the 
exponential always governs the asymptotic behavior of the solution whenever dampening 
does occur. 
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Any ratio of the form 


1 

0 

0 

0 

a 33° ■ ■ ■ 

a jk ° 

O 

a kj ° • * • 

dhk° 

au° 

dif ■ ■ ■ 

&ik° 

dji° 

dj? • ■ ■ 

a jh ° 

a H ° 

a k j° • • * 

&kk° 


( 48 ) 


is necessarily negative in sign. But such ratios are precisely equal 
to the change in the price of the ith. good with respect to a unit 
increase in its own supply when appropriate subsets of other prices 
are held Constant, so that for this case the stability criteria of Pro- 
fessor Hicks are seen to be correct theorems. 

Where perfect symmetry is not present (and in business-cycle 
analysis it is almost always absent), the Hicks criteria are not at 
all necessary conditions and in many cases not sufficient . 16 A 
system may possess stability of the first kind even though neither 
perfectly nor imperfectly stable in Hicks’s sense. I long suspected 
that perfect stability is a sufficient condition for stability of the 
first kind. But this conjecture proved to be false. Perfect sta- 
bility, like imperfect stability, is neither a necessary nor sufficient 

16 A word of warning may be in order concerning the indiscriminate use of either 
prices or quantities as independent variables. This leads to contradictory definitions of 
complementarity in the literary discussion on page 44 and the mathematical definitions 
on page 311, the inconsistency between which can lead to opposite signs. Such an 
interchange of independent variables (as between the literary definition of stability on 
page 63 and the mathematical conditions on 315 and 325) is particularly important 
where nonsymmetrical matrices are concerned. Does 


imply 


dxi 

dpi 


dpi 

dxi 


< 0 , 


< 0 , 


dxi 

dxi 

dp% 

dpj 

dXj 

dxj 

dpi 

dpi 

dpi 

dpi 

dxi 

dxj 

dpi 

dpi_ 

dxi 

dxj 


> 0 , 


where 


dxi dxj 
dpi ^ dpi ' 


> 0 , 


. . .? 


The answer is yes, but the proof is not simple. Even with symmetry the product 
(dpi/dxj) (dxi/dpj) need not be of positive sign if more than two variables are involved. 
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condition. 17 In any case it is too strict a condition, while the re- 
quirement of imperfect stability is not strict enough; only in the 
case of symmetry do these limits converge. Why any system 
should be expected to possess perfect stability, or why an economist 
should be interested in this property is by no means clear. Not 
working with an explicit dynamical model, Professor Hicks prob- 
ably argued by analogy from well-known maximum conditions, 
whereby a maximum must hold for arbitrary displacements and 
through any transformation of variables. As a result, some vari- 
ables may be made constants, and with respect to the remaining 
arbitrary subsets the definiteness of various quadratic forms must 
be insured. On the other hand, in terms of a truly dynamic process 
the equilibrium must be stable for arbitrary initial conditions or 
displacements and for arbitrary nonsingular transformations of 
variables, but not necessarily for arbitrary modifications of the 
dynamic equations of motion such as are involved in the Hicks 
procedure of holding subsets of other prices constant (by violating 
or relaxing true dynamical relations). In principle the Hicks pro- 
cedure is clearly wrong, although in some empirical cases it may be 
useful to make the hypothesis that the equilibrium is stable even 
without the “equilibrating” action of some variable which may be 
arbitrarily held constant. (In connection with the Keynesian 
model later discussed, an example of this is presented.) 

To summarize: for every case true necessary and sufficient 
stability conditions are that i?(\y) < 0, where \> represents the 


17 The matrix 


" € 1 0 0 ' 

0 € 1 0 

0 0 € i 

+1 -1 1 + 6 . 


for sufficiently small e’s has all principal minors positive and yet has some roots whose 
real parts are negative. This shows that even Hicksian perfect stability does not 
guarantee dynamic stability. See P. A. Samuelson, “The Relation between Hicksian 
Stability and True Dynamic Stability,” Econometrica , XII (1944), 256-257, Also, as 
Metzler has recently shown, the Hicksian conditions are necessary (but not sufficient) if 
the system is to be stable for all possible (positive) rates of adjustment in different 
markets; and if all off-diagonal terms are non-negative, the Hicks conditions are both 
necessary and sufficient for stability. L. A. Metzler, “Stability of Multiple Markets: 
The Hicks Conditions,” Econometrica , XIII (1945), 272— 292. For a proof that the 
inverse of the last matrix must consist of all elements of the same sign, see j. Mosak, 
General Equilibrium Theory in International Trade (Bloomington, Indiana: Principia 
Press, 1944), p. 49. 
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latent roots of the matrix a. This is not equivalent to the Hicks 
conditions. 18 

In an address before the Econometric Society Professor Lange 19 
suggested that the speed of reaction might be different in every 
market so that instead of writing 

Pi = H'q t + ■■■ (49) 

as in (43), we write 

Pi = Hiqi + • ■ • , (50) 

where Hi = H/( 0) is the (positive) coefficient appropriate to the 
tth market. 

Even this is not sufficiently general unless a particular com- 
modity classification is to be favored, and if our formulation is to 
be invariant under linear transformations of commodities and 
prices. We have no choice but to admit that the rate of adjust-' 
ment in one market may depend on the excess demand in other 
markets; so that in matrix terms 

p = Hq+ ••• = Hap + •••, (51) 


where q is the column of ql s ; a the square matrix a if ; and where H 
need not now be a positive diagonal matrix. It would seem 
reasonable at first to require H to be positive definite, or at least to 
be positive quasi-definite, as the latter expression was defined in 
chapter vi. But if we apply the contragredient transformation c 
as given in equation (47) of chapter vi, 


1 = cq; q = c~ l q 

p = c ,-1 p ; p = c'p, 


(52) 


18 The following illustrations bear this out: The system 

Pi = - 2 Pi + 4 £ 2 , 
p2 ~ — pl p2, 

possesses stability of the first kind, but is neither perfectly nor imperfectly stable. 
The system 

Pl = pl — p2, 
p2 — 2pl -f- p2y 

is imperfectly stable, but departs ever further from equilibrium. 

19 O. Lange, Abstract of “The Stability of Economic Equilibrium,’ ' Econometrica , 
X (1942), 176-177; Price Flexibility and Employment (Bloomington, Indiana: Principia 
Press, 1944), appendix. 


i. 
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then it can be verified that 

p — Rq — • = c'Hq Rap -| — • = (c'Hac'~ l )P -f- • • • . ( 53 ) 

It follows that our restrictions cannot be placed upon H alone, 
but only upon the product of H and a ; that is, the results depend 
upon the statical matrix a and upon the dynamical reaction-speed 
matrix H. It is sufficient for stability that in some coordinate 
system H~ 1 be positive quasi-definite and a negative quasi-definite, 
but these conditions are by no means necessary. 

Before leaving the problem of stable multiple markets, I should 
like to sketch the effect of the introduction of stocks and its rele- 
vance to stability of the second kind. Let price fall not when cur- 
rent supply exceeds current demand, but when existing stocks (ac- 
cumulated over time from the divergence of current production and 
consumption) exceed an equilibrium amount. Then neglecting 
terms of higher powers, we write 

Pi = Qi° ~ f ( qs ~ qD)dr = Qi° + f E a if (Pi - Pf)dr + • • • 

0 Jo 3=1 

or 

pi = Eo>/to — Pf) +■•••. (54) 

whose solution takes the form 

pm = pi° + E + hue-**), (55) 

i=l 

where = 0, and where for unrepeated roots the k’s 

and the h’s are constant depending upon initial conditions. Clearly 
the motion will be explosive and undamped unless Vky are all pure 
imaginary numbers; i.e., unless is real and negative. 

If the system is symmetrical, this clearly leads to the same 
conditions as those for stability of the first kind. If not sym- 
metrical, the substitution of second derivatives everywhere for first 
derivatives (through the hypothesis of dependence upon accumu- 
lated stocks rather than instantaneous flows) implies more rigid 
conditions upon the coefficients to insure stability of the second 
kind than were previously required to insure stability of the first 
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kind. This is of course because of the requirement that the roots 
be real as well as negative. 20 

Analysis of the Keynesian System 

Up until now I have considered examples drawn from the field 
of economic theory. The techniques used there are of even more 
fruitful applicability to problems of business cycles. To illustrate 
this I shall analyze in some detail the simple Keynesian model as 
outlined in the General Theory. Various writers, such as Meade, 
Hicks, and Lange, have developed explicitly in mathematical form 
the meaning of the Keynesian system. 21 The three fundamental 
relationships stressed by Keynes are (1) the consumption function 
relating* consumption (and hence savings-investment) to income, 
and for generality to the interest rate as well; (2) the marginal 
efficiency of capital relating net investment to the interest rate and 
to the level of income (as of a fixed level of capital equipment, fixed 
for the short period under investigation) ; (3) the schedule of liquidity 
preference relating the existing amount of money to the interest 
rate and the level of income. 

Mathematically, these may be written as follows: 

C(i, 7) -Y + I = - a, (56) 

F(i, Y) - I = - (3, (57) 

m, Y) = M, (58) 

where i, Y, I stand respectively for the interest rate, income, and 
investment; C, F, L stand respectively for the consumption func- 
tion, the marginal-efficiency-of-capital schedule, and the schedule 
of liquidity preference. M stands for the existing amount of 
money, taken as a parameter; a is a general parameter representing 

20 One could consider tlie generalization of the intermediate hypothesis of footnote 
12 where price change depends upon stocks and flows, namely, 

Pi = 12 OifLapj + 13 (Pi - pj°)l- ( ^ > 0 ) 

If stable for > 0, a = 0, and also for /3 « 0; a > 0, it can perhaps be proved to be 
stable for all intermediate cases. 

21 J. E. Meade, “A Simplified Model of Mr. Keynes’ System,” Review of Economic 
Studies , IV (1937), 98-107; J. R. Hicks, “Mr. Keynes and the “Classics’; A Suggested 
Interpretation,” Econometrica , V (1937), 147-159; Oskar Lange, “The Rate of Interest 
and the Optimum Propensity to Consume,” Economica , V (1938), 12-32. 
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an upward shift in the propensity-to-consume schedule; similarly 
as the parameter P increases, the marginal-efficiency schedule shifts 
upward. 

We have three relations to determine the three unknowns in 
terms of three parameters, viz. : 

i = i(a, P, M), 

Y = Y(a, 13, M), (59) 

I = I(a, 13, M). 

As explained in the first section of this paper, the usefulness of the 
Keynesian equilibrium system lies in the light it throws upon the 
way our unknowns will change as a result of changes in data. 
More specifically, what are the signs of 


di 

dY 

di 

!£’ 

da ’ 

da ’ 

di 

dY 

di 

W ’ 

dp ’ 

dp ’ 

di 

dY 

dM ' 

dM’ 

dM 1 


Differentiating totally with respect to our parameters and evaluat- 
ing the resulting linear equations, we find 


di 

— Ly 

dY _ 

Li 

di _ 

FyLi FiLy 

da 

A ’ 

da 

A ’ 

da 

A 

di 

— Ly 

dY _ 

Li 

di _ 

(1 — Cy)Li -f- CiL 

d& 

A ’ 

dp ~~ 

A ’ 

dp — 

A 


di 1 - Cy- F r dY Fi + Ci 
dM~ A 1 dM~ A ’ 

dl_ Fy(Fj + Ci) + (1 - Cy - Fy)Fi 
dM A 

where 


= Ly{Fi + Ci) H- Li( 1 -Cr- Fy). (61) 


Ci Cy - 1 1 

A = Fi Fy - 1 
Li Ly 0 

On the basis of a priori, intuitive, empirical experience the 
following assumptions are usually made: 

Cy > 0, Fy> 0, Fi < 0, Ly > 0, Li < 0, (62) 
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while 

C t = 0 

and is usually assumed in modem discussions to be of minor quan- 
titative importance. 

In order to evaluate our nine derivatives we must be able to 
determine unambiguously the signs of all numerators as well as the 
common denominator, A. A consists of five terms, two of which 
are of positive sign, two of negative sign, and one ambiguous. On 
the basis of deductive analysis along strictly statical lines nothing 
can be inferred concerning its sign. Moreover, even if the sign of 
A were determined, all but four of the nine would be found to have 
numerators of indeterminable sign. 

This- is a typical case. If we are to derive useful theorems, we 
must clearly proceed to a consideration of a more general dynamic 
system which includes the stationary Keynesian analysis as a spe- 
cial case. This can be done in a variety of alternative ways. I 
shall consider two, the first of which is based upon a differential 
system and yields quite definite results. 

Case 1. Let us assume as before that the second and third 
relations of marginal efficiency and liquidity preference work them- 
selves out in so short a time that they can be regarded as holding 
instantaneously. Let us assume, however, that I now represents 
“intended” investment, and this magnitude equals savings-in vest- 
ment only in equilibrium, i.e., when all the variables take on sta- 
tionary values. If, however, because of some change, consumption 
(say) should suddenly increase, national income not having a 
chance to change, actual savings-investment would fall short of 
“intended” investment because of inventory reduction, etc. Con- 
sequently, income would tend to rise. Similarly an excess of actual 
savings-investment over intended investment would tend to make 
income fall. Mathematically, this hypothesis may be stated as 
follows: the rate of change of income is proportional to the difference 
between intended savings-investment and actual savings-investment. 
The discussion here is unrelated to the controversy over the equal- 
ity of savings and investment despite possible appearances to the 
contrary. The superficial resemblance between my formulation 
and the Robertsonian identities whereby the difference between 
investment and savings is the time difference of income should not 
mislead the careful reader. 
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Equations (56), (57), and (58) are replaced by the dynamical 
ones: 

f = I - [F - C(i, Y) - (63) 

0 = F(i, Y) -1 + 0, (64) 

0 = L(i, Y) - M. (65) 

The solution of these is of the form : 


where 


Y = Y° + a t e+ 
i — i° + a 2 e X( , 
I = 1° + 


A(X) = 


Q 

Li 


Cy — 1 — X 

Ft 

Ly 



= A + XZ,,- = 0.* 


( 66 ) 


(67) 


The equilibrium is stable only if 



x = - A < 0 . 

(68) 

But Li < 0; therefore, 

A <0 

(69) 

unambiguously. 




This establishes four theorems: an increased marginal efficiency 
of capital will (1) raise interest rates and (2) raise income; an 
increased propensity to consume will (3) raise interest rates and 
(4) raise income. But how will the creation of new money affect 
interest rates? This can be answered by considering more string- 
ent stability conditions. Let us suppose that the interest rate 
were kept constant (say) by appropriate central bank action. 
This assumption is equivalent to dropping the liquidity preference 
equation (65) and treating i as a constant in the remaining equa- 
tions. If the equilibrium is stable under these conditions, we 
must have 


Cy - 1 - X 
Fy 


— 0 — (1 — Cy — Fy) "4* X, 


(70) 


or 

- X = (1 - Fy - Cy) > 0. (71) 

This leads to another important theorem: (5) the marginal pro- 
pensity to consume plus the marginal propensity to invest cannot 
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exceed unity or the system will be unstable (as of a fixed interest 
rate). 22 It also tells us (6) that an increase in the amount of 
money must, ceteris paribus , lower interest rates. 

We are left with four ambiguities of sign. Two of them depend 
upon the fact that savings may vary in any direction with respect 
to a change in interest rates. If we assume that normally savings 
out of a given income increase with the interest rate, or, if they do 
decrease, do so not so much as does investment, then three more 
theorems become true: an increase in the amount of money (7) 
increases income and (8) increases investment; (9) an increase in 
the marginal-efficiency schedule increases investment. There re- 
mains a final ambiguous term. What is the effect upon investment 
of an increased propensity to consume? This is seen to be essen- 
tially ambiguous depending upon the quantitative strengths of the 
liquidity-preference slopes and the marginal-efficiency slopes. As 
income increases, money becomes tight because of the need for 
financing more transactions. This tends to depress investment. 
As an offset, the increase in income tends to increase investment 
through the marginal propensity to invest. Which effect will be 
the stronger cannot be decided on a priori grounds. 

I have prepared a 3 X 3 classification indicating the signs of 
the nine terms. All but four have definite signs. Of these four, 
one is essentially ambiguous as indicated by a question mark. The 
remaining three show under question marks their normal, pre- 
sumptive signs. 


Increase in propensity to consume 
Increase in marginal efficiency of capital 
Increase in amount of money 

n If we take investment also as an independent parameter (say through government 
action), we lose equation (57) and have for stability the condition 

i Cr — 1 -X| « (Cr - 1) - X =0, 

X « Cy - 1 < 0, 

or that the marginal propensity to consume must be less than one . But this is weaker than 
the previous condition in view of the fact that the marginal propensity to invest is 
assumed positive. 



i 

Y 

I 

a 

+ 

+ 

? 

|8 
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+ 
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+ 
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Case 2. I now turn to a system based on a difference equation. 
It is founded upon considerations similar to those underlying the 
Kahn-Clark multiplier block diagrams, and for this reason alone 
is worth consideration. In addition, the analytical contrasts be- 
tween differential and difference systems is brought out. Revers- 
ing the order of the previous exposition, let us take investment as 
an independent parameter and the interest rate as a constant. Let 
consumption be a given function of income during the preceding 
period of time: 

C t = C(l Y t -i) = (72) 

What properties must this function satisfy if the equilibrium is 
to be stable? Income clearly equals consumption plus investment: 

Y t = C t + It. ’ (73) 

Recalling that investment is treated as a constant, I, and using 
the consumption relation, we find 

Yt = C(Yt-i) + I, (74) 

or, to a first approximation, 

(Y t - Y°) = CyHY t ^ - P), (75) 

where 

P = C(Y°) + I (76) 

is the equilibrium level of income for investment equal to I. 

The solution of this difference equation takes the form 

Yt = P + K(Cr°y (77) 

and is stable only if 

| Cy° | < 1 (78) 

or 

- 1 < Cr° < 1." (79) 

While the marginal propensity to consume is usually assumed to 
be positive, it need not be so, and still the equilibrium can be a 

23 This inequality is in effect the formal justification of Keynes’ reply to those 
criticizing his fundamental law, that the burden of proof lay upon them to explain 
why, if their allegations were correct, the economic system was not hopelessly unstable. 
See the passages quoted from a letter of Keynes in E. W. Gilboy, ‘‘The Propensity to 
Consume: Reply,” Quarterly Journal of Economics, LIII (1939), 634. While funda- 
mentally correct, Keynes does overlook the possibility of other stabilizers such as the 
marginal propensity to invest, interest rate, etc. 
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stable one. Even if it lies between zero and minus one, it is inter- 
esting to observe that the “multiplier” is positive since 


dY° _ 1 

dl ~ 1 - C Y ° 


> 0 , 


(80) 


but less than unity because of negative “secondary” effects. 

Let us now drop the assumption that investment is a datum, 
although keeping the interest rate constant. Our dynamic system 
is of the form 



C(f, Yt-i) 

I - Y t + It = 

o, 

(81) 


F{t, Y t ) - 

■it = 

o, 

(82) 

and the equilibrium is stable 

only if 





N- 

Cy 

1 - F y 

< i, 


(83) 

or 







-I1--FVI 

< Cy < 

:|i- 

Fy |. 

(84) 


Now if the marginal propensity to invest is less than unity 
(1 — Fy > 0), 24 this leads to essentially the same stability condi- 
tions as before, namely, the marginal propensity to consume 
plus the marginal propensity to invest must be less than unity 
( Cy + Fy < 1). But, and this is paradoxical, if the marginal 
propensity to invest is sufficiently large, i.e., greater than +2, the 
marginal propensity to consume may exceed unity, and yet the 
equilibrium will be stable! Moreover, beyond a certain critical 
value the larger the marginal propensity to invest, the more stable 
is the system. 

If we now consider the system in which none of the variables 
are taken as given, namely, 

C(i t , F*_i) - Y t + I t = 0, 

F(i t , Y t ) - It = 0, (85) 

L(it, Y t ) — M — 0, 

84 In the marginal-efficiency relation I have made investment depend upon income, 
which itself includes investment. Other writers, notably Lange {op. cit.), have made 
it depend only upon consumption. The result is indifferent since they can be shown 
to be equivalent. If, however, it is assumed that dl/dC > 0, the marginal propensity 
to invest, dl/dY = {dI/dC)/[l -f (dI/dC)2, cannot exceed unity. If a one-period lag 
is put in (82), then the sum of the consumption and investment propensities must be 
less than one. 
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stability requires that 


N = 


-£>tCy 

A -j- UC Y 


< 1 . 


( 86 ) 


In what may be termed the normal case, where the marginal 
propensity to invest is less than unity, this requires as before that 

A < 0, (87) 


and immediately all the eight determinations of sign of Case I 
become correct. 

In the unusual, but possible, case where 


1 - F y < 0 < C T < (F y - 1) 


U 


C Pi + Ci), (88) 


the equilibrium will be stable, but the signs of our 3X3 table now 
are as follows: 



i 

F 

I 

a 

— 

— 

? 

13 

— 

— 

? 

M 

— 

? 

? 


In words, the only theorem which remains true under all circum- 
stances is that an increase in the amount of money must lower 
interest rates if the equilibrium is stable. 

This example illustrates the additional complexities which sys- 
tems based upon difference equations involve. Later some of the 
reasons for this will be explained. 

The examples here adduced serve, I hope, to illustrate the light 
which dynamical analysis sheds upon comparative statics. Prob- 
lems in theory and business cycles of any complexity will almost 
surely require similar analytic treatment if useful and meaningful 
theorems are to be derived. 



CHAPTER X 


THE STABILITY OF EQUILIBRIUM: LINEAR 
AND NONLINEAR SYSTEMS 

Introduction 

In the previous chapter it was pointed out that there exists an 
intimate formal dependence between comparative statics and dy- 
namics. To my knowledge this had not previously been explicitly 
enunciated in the economic literature, and for lack of a better name 

r 

I shall refer to it as the Correspondence Principle. It is the purpose 
here to probe more deeply into its analytical character, and also to 
show its two-way nature: not only can the investigation of the dy- 
namic stability of a system yield fruitful theorems in statical 
: analysis, but also known properties of a (comparative) statical 

system can be utilized to derive information concerning the dy- 
namic properties of a system. 

An understanding of this principle is all the more important at 
a time when pure economic theory has undergone a revolution of 
i! thought — from statical to dynamical modes. While many earlier 

' foreshadowings can be found in the literature, we may date this 

upheaval from the publication of Ragnar Frisch’s Cassel Volume 
essay of only a decade ago . 1 The resulting change in outlook can 
be compared to that of the transition from classical to quantum 
mechanics. And just as in the field of physics it was well that the 
relationship between the old and the new theories could be in part 
clarified, so in our field a similar investigation seems in order. 

Before entering, however, upon these unavoidably technical 
matters, a few obiter dicta concerning the fundamental differences 
between statical and dynamical systems may be in order. Con- 
ceived broadly, a dynamical system might be regarded as any set 
of functional equations which together with initial conditions (in 
the most general sense) determine as solutions certain unknowns in 
function of time. ' According to this definition timeless, statical 

1 Ragnar Frisch, 41 Propagation Problems and Impulse Problems in Dynamic Eoc- 
nomics,” Economic Essays in Honor of Gustav Cassel (London, 1933), pp. 171-205. 
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systems are simply degenerate special cases in which the functional 
equations take on simple forms and determine as solutions func- 
tions of time which are identically constants. We may, however, 
define a dynamical system more narrowly so that it will not be 
regarded as truly dynamic if the functional equations involve only 
variables “of the same instant of time,” containing time, if at all, 
only as a parameter. 2 This excludes the customary statical sys- 
tems of the “historical” as well as the timeless variety. 3 It is 
possible, however, that certain subsets of the solutions of the dy- 
namical equations are defined by equations which are structurally 
identical with those which define a statical system. (Thus the sta- 
tionary solution of a time-sequence analysis, say of the multiplier- 
block-diagram variety, may be determined by a formula exactly 
like that of a timeless, instantaneous system.) This constitutes a 
second possible mutual orientation of statical and dynamical 
systems. 

From still a third point of view a statical system can be looked 
upon as the limiting case of a heavily damped dynamical system. 
Thus any statical equation 

f(x) = 0 

which admits a unique solution x° can be related to a dynamical 
system of the form 

f{x t + &(x t - x 0 )} = 0. 

This yields directly the equivalent linear difference equation 

x t + A(x t — x°) = x°, 

or 

X t+ 1 = X 

Thus, whatever the initial magnitude of x, at the next “instant” 
the system always takes on its correct statical value. This general- 
izes easily to systems of more than one variable. 

In the following I deal with interrelationships between statics 
and dynamics which largely fall under the second of the three 
headings here discussed. 

2 Ragnar Frisch, “On the Notion of Equilibrium and Disequilibrium,” Review of 
Economic Studies , III, February, 1936, 100-105. 

1 In chapter xi I deal at some length with the distinction between complete causal 
systems and historical or incomplete causal ones, also with the closely related topic of 
the generalization of the notion of stationary equilibrium to systems involving time 
explicitly- 
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Functional Equations and Stationary Solutions 

Starting out with n functional equations which constitute a dy- 
namical system involving n unknown functions \xx(t), ■ ■ 
but not time explicitly, 

F^ixiir), x 2 (r), • • x„(r) 5 = 0, (i = 1, •••,«) (1) 

—00 — CO —00 

we define a stationary solution ( x° , • • • , x n °) as one for which 

F { {x i°, x 2 °, • • x n °\ = 0. (2) 

— 00 “00 —00 

These last equations correspond to a set of ordinary statical func- 
tions in n variables (xi, • • • , x n ) : 

f(x u ■ • •, x n ) = 0, (i = 1, • • •, n) (3) 

where of course 

/*'(* i°, • • •. x n°) =0. (i = 1, ■■•,%) (4) 

The types of functional equations which have been most studied 
are those defined by differential equations, difference equations, 
and integral equations, and mixed varieties. The first of these 
possesses the most highly developed theory and provides valuable 
examples of various principles. Since economic observations con- 
sist essentially of series defined for integral values of time, the 
second category of difference equations is perhaps of greatest 
interest to the theoretical economist. 

The above classes of functional equations have this much in 
common: they can all be written as the limit of an infinite set of 
equations in an infinite number of unknowns. However, it is not 
customary to write a system of differential equations in the form 
(1) ; but by use of the singular Dirac 6 function defined to make the 
following identity formally true, 

f(x) - f 8 (a — x)f(a)da, (5) 

%/~co 

any linear differential equation can be written as an integral equa- 
tion. Similarly any integral equation of the form 

B{t) + k(a)B(t - a)da = 0 , 


( 6 ) 



LINEAR AND NONLINEAR STABILITY 


287 

where B is analytic and k possesses finite moments of all orders, 
can be written as a differential equation of infinite order, namely, 

B{t) + £ CiB*(t) = 0, 

0 

where 

(—1)* f kia)a l da 

„ Jo 


Difference equations and mixed types can also be regarded as 
differential equations of infinite order; by the use of the Dirac func- 
tion or by extension of the definition of integration, they can 
equally well be represented as integral equations. In the following 
treatment I shall investigate formal identities, being little con- 
cerned with problems of convergence, and omitting rigorous proofs. 
There is ample precedent and pragmatic justification for this pro- 
cedure in all applied sciences. 

The greatest attention will be paid to systems of differential 
equations and systems of difference equations. Without loss of 
generality these can be written in the following normal form: 



~ = fix 1 , • • •, x n ), (i = 1, ■■ - ,n) 

(9) 

and 


AXi(t) = g^XiOO. • ”,X n (t)U 

(10) 

or 


Xi(t + 1) =-G i {xiit), • • •, *„(/)} = Xi 4- g\ 

(11) 


If not already in this form they can be so transformed by the intro- 
duction of new variables. 

For the stationary solutions, 


-^r = 0 = f(x u • (* = 1, •••,») (12) 

or 

Axi = 0 = g%x 1 , ■ ■ - ,x n ) — G i — Xi. (i = 1 , ■■ - ,n) ( 13 ) 

Systems of Volterra linear integral equations with Poisson 
kernels of the form 

Xi(t ) = ) + E f Kij{t - e)Xj (e)de, (i = 1, • • • , n) (14) 

1 J-OD 


(7) 

(8) 
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are also of interest. Unfortunately, the theory of nonlinear in- 
tegral equations is only fragmentary. 

Linear and Nonlinear Systems 

Up until now most economists have concerned themselves with 
linear systems, not because of any belief that the facts were so 
simple, but rather because of the mathematical difficulties involved 
in nonlinear systems. This is understandable and excusable since 
all thought advances in small steps. Nevertheless, from the stand- 
point of a study of industrial fluctuations this can be a rather seri- 
outs limitation. Thus, in a linear system the amplitude of fluctua- 
tion depends upon the initial displacement ; no intrinsic amplitude 
— as between full employment and zero employment — is involved. 
The attempt to introduce such a fixed amplitude in a linear system 
by the device of determining coefficients so that there will be 
neither dampened nor explosive solutions appears to be misdirected. 
As pointed out in the previous chapter, page 268, the stochastical 
dispersion of the system increases indefinitely. Related to this is 
the fact that this does not yield a unique amplitude, but one which 
depends linearly upon initial conditions.) 

If we insist that a system be linear and that it do not involve 
time explicitly, then for differential and difference systems we are 
restricted to the case of constant coefficients. This type is mathe- 
matically simple, and exact solutions are known. But a high price 
is paid for this simplicity in terms of special assumptions which 
must be made. 

The Nonlinear Differential Equation in one Variable 

Nevertheless, I shall show that the problem of stability of equi- 
librium, if not that of macrodynamic business-cycle analysis, de- 
pends formally in an important way upon the solution of linear 
systems. This may be illustrated by a simple differential equation 
in one variable. 

= X =/(X) = A 0 + AiX + A*X* + •••, (15) 

where/ is analytic and expressible as a power series. This equation 
presents no difficulties of solution since by a single quadrature it 
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may be solved ; namely, 

£w)~ r{X) - <16) 

Let us suppose a “simple” stationary solution X = X° exists 
so that 


sm = 0 , 

f(X°) * 0. 

Then the transformation of variables 

* = X - X° 


(17) 


(IS) 


transforms (15) into the form 


Z f(X°)x< 

x = f(x + X°) = — 77 - 0 + d\X + O 2 X 2 + • • • , 

'll 


(19) 


where a constant term is no longer involved, and a x does not vanish. 

Then we can assert : 

Theorem I : A formal solution of the differential equation (19) can 
be written as an infinite power series in the solution of the simple linear 
system 

x = a\X t (20) 

or 

x(t) = Z Ci{gi(a, 25) } = z (21) 

1 1 


where a is a constant depending upon initial conditions . 4 

That this holds formally (i.e., without regard to the question of 
convergence) can be shown by substitution. Thus, 

X - Z Kigd = X>ji Z Ckgl k }L (22) 

X 1 1 

Expanding out and equating coefficients of the same degree in gi, 
we find that each c can be determined in succession from the a’s 

4 E. Picard, Traits d' Analyse, III, 185; G. D. Rirkhoff, Dynamical Systems (New 
York, 1927), chapter iii. 
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and all previous c’$. 

Ci 

C2 


> 

> 

C n “ ’NL(Cn—\) £l)« 

Equation (15) before transformation to the form not involving 
a constant term can in a similar fashion be formally solved by an 
infinite power series in the solution of the equation 

x ~ uq, (24) 

or 

X = £ Kig a { (t, a) = L + a}*'. (25) 

1 i 

But this is only the conventional power-series solution in t and tells 
us nothing about the stability of the system ; every term taken by 
itself goes to infinity regardless of the a’s, although the infinite 
sum need not. 

It is well known that such a series as that in the exponentials 
is convergent for absolute values of a sufficiently small in a time 
interval sufficiently small. But if the system possesses first-order 
stability, 5 i.e., if a x < 0, then the series will converge for all values 
of t between t 0 and + 00 since | e aiti | is decreasing with time, ap- 
proaching zero in the limit. Since each term approaches this 
limit, then 

lim x(t, a) = 0. (26) 

00 

Therefore , if the system possesses first-order stability, it necessarily 
possesses stability in the small. It is to be emphasized that this is 
an exact solution in which no terms are considered to be of an order 
of smallness and ignorable. Likewise, if the system possesses first- 
order instability, it must be unstable in the small. However, the 
system may be in neutral first-order equilibrium, and possess either 
stability or instability in the small. In this case a\ vanishes, and 
we must consider the stability of the nonvanishing term of lowest 

6 Not to be confused with stability of the first kind, a concept employed in the 
previous chapter. 


1 , 

2azCi 

a x 


( 23 ) 
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degree. This will be done later. We may state our results as 
follows: 

Theorem II: (a) First-order stability is a sufficient condition for 
stability in the small, (b) The absence of first-order instability is a 
necessary condition for stability in the small. 

Example: Logistic Law 

We may illustrate the above principles by considering the 
simplest nonlinear system 

X =* A a + AyX + AfX 1 , (27) 

where 

-4i 2 - 4 AoA t > 0, (28) 

if there are to be any “simple” stationary solutions. ’ 

Without essential loss of generality, we may by linear trans- 
formation of X involving only translation and scale change, and 
by scale change in t, bring the above system into the form 

x = *(1 - x). (29) 

The equation 

x-x t = 0 (30) 

has roots 0 and 1, each of which represents a stationary state. The 
above differential equation will be recognized as one which is satis- 
fied by the Verhulst-Pearl-Reed logistic law, according to which 
percentage changes in a variable fall off linearly with the magnitude 
of that variable, approaching a limit asymptotically. The above 
equation, however, is slightly more general since it admits of solu- 
tions which are not 5-shaped. 

By quadrature its general solution is found to be of the form 

* = 1 + Ke~‘ ’ ( 31 ) 

where K is a parameter determined by the initial value of x at time 
zero according to the formula 

(32) 

For K = 0 we have the stationary solution 

x{t) S3 1, 


(33) 
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and for K = «© we have the stationary solution 


x(t) 0. (34) 

It can be easily verified that the latter stationary level is un- 
stable, while the stationary level of unity is approached asymp- 
totically by all adjacent motions. We may classify all possible 
initial conditions as follows : 


+ is*(0)2». - l < x a o, ) 
o < *(0) a l, osx< + «,| i!2* w 

z(0) =0, K — oo, x = 0; 


1 ; 


(35) 


— oo < x(0) <0, — oo < K < — 1, 


lim x(t) = — oo . 


Let us now apply our expansion theorem to this problem. 6 . 
Expanding around the zero equilibrium point, and determining the 
c coefficients, we readily find 

x — ae‘ — {ae'} 2 + {ae‘} 3 — {ae l \ i -{-■■■= Y. (— l)’ -1 {ae‘S*. (36) 


For given values of t and sufficiently small values of a this is readily 
seen to be a convergent series equal to the geometric series expan- 
sion of 


«e* 

1 + ae l 


1 

1 + - 
a 


(37) 


But for large values of t 

\ae l J > 1, 


(38) 


and the series diverges. This confirms our expectation that the 
zero equilibrium level is unstable. 

The transformation 

y = X — 1 (39) 

enables us to apply our expansion theorem to the determination of 
the stability of the other equilibrium level. Our differential equa- 
tion becomes 

y = - y + y 2 , (40) 

and in the new variable the equilibrium level equals zero. De- 
9 Cf. A. j. Lotka, Elements of Physical Biology (Baltimore, 1925), pp. 64-68. 
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termining the c’s by substitution as before, we find 

y(t) = - {Re-*} + [Ke- 1 ]* - {X<r‘} 8 + {Ke- l V - •• • (41) 

= £ (- 
1 

This will be recognized as the formal expansion of 


y = 


1 + Ke~‘ 


(42) 


For small values of K and nonnegative values of t, this converges 
uniformly, and each term goes to zero as time becomes infinite. 
Thus, the equilibrium is stable in the small. 


1 



This example also throws light on the domain of convergence of 
the series of exponentials. It is easy to see from the exact solution 
of (31) that the stationary level (y — 0, * = 1) is stable for all posi- 
tive displacements from equilibrium, i.e., y{ 0) > 0, x(0) > 1, and 
also for all negative displacements which are less than unity 
in absolute value (— 1 < y(0) < 0, 0 < x(0) < 1). But the 
series expansion of (41) is convergent only for \K\< 1 , or for 
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(— I < y(0) < + =o, i < x(0) < co). 7 Thus, its domain of con- 
vergence is smaller than the region of true stability. It provides 
only a lower limit which is not generally at the same time an upper 
limit to the region of stability. 

In Figure 4 is depicted the solution of this equation for all 
possible initial conditions x(0). The stationary level of unity is 
seen to be stable ; that of zero unstable. The diagram brings out 
one feature not yet mentioned. The lower and upper branches 
each approach an asymptote for finite values of t. This means 
that for negative displacements around the zero level the system 
recedes from equilibrium at an infinite rate of speed after some 
finite time has elapsed. For the upper branch it may be loosely 
said that the system approaches equilibrium after “coming in from 
infinity at an infinite velocity.” 

This example also suggests what is doubtless a valid “separa- 
tion” theorem. Points of stable equilibrium {in the small ) are 
separated by points of definitely unstable equilibrium; and vice versa. 
(In the interpretation of this, stable or unstable equilibrium may 
be of higher order so long as positions of one-sided stability- 
instability are ignored.) 

The Problem of Higher-Order Stability 

Thus far I have considered only “simple” stationary states; i.e., 
those with power-series expressions in which the first-degree term 
does not vanish. I now turn to “degenerate” stationary states, 
those corresponding to multiple roots of the equation 

X = 0 =?f(X), (43) 

where 

= 0. (i-0, •••,»- 1;»£2) (44) 

In the neighborhood of such a root the differential equation takes 
the form 

x — a n x n + a n+ ix n+1 + * • • , (45) 

* The expansion breaks down at the level at which the curve has an inflection point; 
i.e., where 

y(t ) = 0 = f'{y(t)}y. 

I venture the conjecture, completely unverified, that this may be a general phenomenon. 
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where 

n = 2. 

The solution of the equation 

x = a n x n , (46) 

where higher powers are neglected, can be expected by analogy 
with the previous case to dominate the remaining terms for suffi- 
ciently small initial displacements of equilibrium. In the investi- 
gation of simple systems this was analytically deduced (viz., 
Theorems I and II) ; for higher-order systems this conjecture has 
not been verified, but is doubtless true. We wish to investigate, 
therefore, the stability of solutions of equation (46). If they are 
all stable (unstable), we shall say that the equilibrium position in 
question possesses wth-order stability (instability). 

By elementary methods of integration we can find the exact 
solution to (46) ; thus 


x~ n dx = a n dt, 

x 1 ~ n — (1 — n)a n t + ^(O) 1- ”. 


(47) 


To solve explicitly for x two cases arise depending upon whether 
n is odd or even. 


~ {o»(l - n)t + x(0)i~» 

1 

x (^) ~ ^ {o„( 1 — ri)t + x(0) 1 ~ n j 1/(n ~ 1) ’ 


for n even ; 
for n odd ; 


(48) 


where the appropriate sign is to be taken so that x satisfies the 
initial condition x = x(0). Both cases may be subsumed under 
the heading 

” |sgn*(0))- ■ (49) 

Clearly if n is even, in — 1) is odd. Therefore, the second 
term within the brackets takes on either sign depending upon 
whether the initial displacement is plus or minus. Consequently, 
regardless of the sign of a n , for some value of t, and for some dis- 
placement, the denominator will vanish, which means that x be- 
comes infinite, and the equilibrium is not stable. Actually for n 
even the equilibrium possesses one-sided stability-instability. If 


FOUNDATIONS OF ECONOMIC ANALYSIS 


296 

x(0) is of the same sign as o», the motion will be unstable; if x(0) 
is of opposite sign to a n , it will be stable. Thus, we have one-sided 
stability-instability. 

If n is odd, then the second term of the denominator is always 
positive. If, and only if, a n is negative (so that the first term is 
positive) will the denominator stay of the same sign and approach 
infinity as t approaches infinity. Hence, we have the following: 

Theorem III: If the first nonvanishing coefficient is odd and nega- 
tive, the system is stable in the small; if the first nonvanishing coefficient 
is odd and positive, the system is unstable. If the first nonvanishing 
term is even, the system possesses one-sided stability-instability. 

This presents a strong analogy to the necessary secondary con- 
ditions for a maximum. Write the differential equation 

- f{X) = F'(X), (50) 

where 

F(X) = f X f(X)dX. 

Then the following sums up the results achieved : 

Theorem IV: (a) If F(X°) affords a relative maximum to F, 
then X° is a stationary solution of the differential equation and 
possesses stability in the small; and conversely. 

(b) If F(X°) affords a relative minimum to F, then X° is an 
unstable equilibrium level. 

( c ) If X° is a stationary value of F(X°) which is not an extremum, 
then the system possesses one-sided stability-instability. Alterna- 
tively, if F assumes a stationary value, and F' — f assumes an ex- 
tremum value, the equilibrium is stable-unstable. 

(d) If F'(X) vanishes identically, the equilibrium is neutral. 

This possibility of linking up the problem of stability with a 

statical maximum problem is but one special aspect of the Corre- 
spondence Principle, and one to which we shall have occasion to 
refer again. 

An Example of One-Sided Stability-Instability: 
Malthusian and Optimum Population Theories 

The meaning of one-sided stability-instability may not be in- 
tuitively obvious; fortunately a simple well-known economic ex- 
ample may be used to illustrate it. According to Malthus popula- 
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tion would increase, decrease, or remain stationary depending upon 
the per capita level of subsistence (real income, food, etc.)- Let 
X = total population, S — per capita real income. Then the 
percentage rate of growth of the population is an increasing func- 
tion of the level of subsistence, passing from negative values to 
positive values at some “minimum level of subsistence,” S°. 
Mathematically, 

(51) 

where 

ns) > 0 , = 0 . 

But the level of income (production) depends itself upon the 
level of population (labor) as of given capital, land, and technology. 
Moreover, Malthus implicitly and explicitly assumed the law of 
diminishing (per capita) returns. Thus, 

5 = <t>(X), <t>'(X) < 0. (52) 

This last relationship enables us to eliminate S as a variable and to 
express the rate of growth of the population in terms of itself. 

j? IT (S3) 

where 

f = W < 0, 

and a stationary level X° corresponds to 

f(X°) = HS°) = 0. (54) 

The equilibrium is stable because f(X°) < 0. If the popula- 
tion exceeds the equilibrium level, each family will receive less than 
the subsistence level, and the population will decline. If it falls 
below the equilibrium level (through war, etc.), the average income 
will be high, and the population will increase. In Figure 5 the 
minimum level of subsistence is shown by MM'. Below this level 
population will decrease as shown by the long arrow; above this 
level it will increase as shown by the other long arrow. R'R is the 
returns curve, and the intersection E, represents a position of 
stable equilibrium as shown by the short directed arrows. 

More modern theory suggests the possibility that there may be 
increasing returns in the early stages. In this case there may be 
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two intersections between the returns curve and the minimum sub- 
sistence level, yielding two points of equilibrium. In the diagram 
the new returns curve is R"R, and the new intersection point is E 2 . 
The new equilibrium point is unstable since here 

f = V4>' > 0. (55) 

If population falls below this level, it will become extinct since as 
returns fall off, so does population, etc., etc. 



Figure 5 

About a decade ago the theory of an optimum population 
achieved a certain vogue. According to one form of this theory, 
at some intermediate point average returns would be at a maxi- 
mum. If by education one could raise the minimum standard of 
comfort insisted upon by all families to this maximum level, then 
population would reach this optimum equilibrium level. Without 
entering upon the merits or demerits of the scheme, I should like 
to point out that such an equilibrium level possesses one-sided 
stability-instability. For displacements from equilibrium towards 
a larger population it is stable, since such a movement lowers re- 
turns and causes population to decrease towards equilibrium. But 
for negative displacements of population it is unstable, since these 
also lower returns and cause population to decrease still more until 
the point of extinction is reached. In Figure 5 the minimum level 
of subsistence (comfort) is irreversibly pushed up by education 
exactly to NN' so that its intersection, £3, with R"R represents a 
maximum of per capita real income. The arrows indicate the 
one-sided nature of the equilibrium point. 
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Analytically, 

f = 0, (56) 

and the first nonvanishing derivative is even and negative. Thus, 
/ is at an extremum, and Theorem IV, (c), applies. The equi- 
librium is one-sided. 8 


Systems of Equations in “ n " Variables 
Definition : The system of n differential equations 

X% “ f(Xi, * , %n) ~~ X) dijXj “f- ^ijkXjXk “1“ * t 

3 j,k 

(i = 1, • ■ •, ») (57) 

where all summations range from 1 to n, will be said to possess a 
simple stationary solution (0, • • • , 0) providing that the matrix 
(an) possesses n (distinct, nonvanishing) roots (Xi, • • • , X„) , which 
are not connected by a linear commensurability relation of the form 

toi + to* + • • • + inK = 0 (58) 

for any set of integers it , • • • , i „, not all zero. 

In this case a fundamental set of solutions of the equations in- 
volving only linear terms 


n 


— 1C G'ijX'j 

(59) 

j—l 


can be written in the form 


aie Mt , • * * , a n e Xni , 

(60) 


where the X’s are the latent roots of the matrix (<%) or roots of the 
secular equation 


»11 — X Oi2 
(Z»l ffl22 — X 


D(X) = 


&nl &n2 


&lit 

n 


= (Xi— X)(X 2 — X) - • • (X„ — X). 



(61) 


8 1 need not warn the reader against the lack of realism of the above theories in 
light of modern demographic trends. For large parts of Western Europe and North 
American where net and gross reproduction rates are low, perhaps no level of real income 
can lead to a stationary population. Moreover, these rates may fall with increasing 
real income; this, however, is presumably an irreversible effect. 
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Then we have the following theorem : 9 

Theorem V: A formal solution of the set of differential equations 
is provided by a power series in the solutions of the first-order equa- 
tions; i.e., 

y% = G i (aie ht , cue™, • • • , a n e ut ) 

= Z Cijiaje™) + Z c ijk a k aje^+^ + • • • . (62) 

3 ■/.* 

By formal substitution this can be verified, remembering that the 
noncommensurability relationship (58) is satisfied. Each set of c’s 
can be determined in terms of previous sets and the known a’s. 

The matrix a may have (pairs of) complex roots, corresponding 
for linear systems to damped or undamped sine-cosine terms. The 
system will be said to have first-order stability if the real parts of 
all roots, real or complex, are all negative, since this will imply 
damped motion (exponential or harmonic) of the linear system. 
An important part of the stability problem is the determination of 
necessary or sufficient conditions that all real parts are negative, 
and this will be dealt with in Mathematical Appendix B. 

Here I should like to mention the generalization of Theorem II. 

Theorem VI : (a) First-order stability is a sufficient condition for 
stability in the small; ( b ) the absence of first-order instability is a 
necessary condition for stability in the small. 

This follows because the series (62) can be shown to converge 
for all values of t and values of a sufficiently small if all the real 
parts are negative. Since it converges for limited values of a and t, 
and since all terms are decreasing in absolute magnitude, it con- 
sequently never ceases to converge. 

A stationary equilibrium position is not “simple” if a com- 
mensurability relationship of the form mentioned holds. Even if 
such a relationship does exist, the above theorem is valid providing 
there are no zero roots. If no roots vanish, linear commensura- 
bility relations introduce into the infinite power series terms of 
the form 

PnW« (,lXl4 "'‘ + (63) 

where p„(t) is a polynomial. If the real parts of all X’s are negative, 
the exponential will dominate, and the solution will still be stable. 

9 Cf. Picard, op. cii.; Birkhoff, op. cit. 
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Zero roots or ones whose real parts are zero, i.e., pure imagi- 
naries, cause greater difficulties since stability in the small becomes 
dependent upon terms of higher degree. 1 am not aware that this 
has been completely analyzed in the mathematical literature except 
for special cases. I shall not, therefore, enter upon the problem 
except to prove a general theorem relating to those many variable 
systems associated with the maximum of some function. 

Before doing so I may summarize briefly the results achieved 
so far as follows: Stability in the small of a nonlinear system of 
differential equations depends except in singular cases upon the 
stability of a linear system. This dependence can be rigorously 
defined and does not involve a dubious neglect of the squares of 
small quantities, etc. 

The Stability of a Stationary Position 

WHICH IS ALSO A MAXIMUM 

If ( Xi° , • • •, X n °) yields a proper relative maximum to a twice 
differentiable function F(X 1 , • • • , X n ), then it is not hard to show 
by the theorem of the mean that 

dF 

~ « Fi{Xt, • • • , X n °) =0, (* = 1, • • • , n) (64) 

and 

OTi, ■■■, X n )(X 1 - Xi°) 

+ Ft{Xu ■ ■ ;X n )(Xt ~ -XV) + • • • < 0 (65) 

for values of X sufficiently close to X°, but distinct from it. 
Suppose we are given a system of differential equations 

,7V . 

= f(Xu • • • , X n ) = Fi(Xi, • • • , X n ). (66) 

Only special differential systems can be so w T ritten. Unfortunately 
space does not permit a discussion of necessary and sufficient condi- 
tions satisfied by such special systems. 

Theorem VII: (Xi°, • • • , X n °) is a stationary solution for the 
above system, and it is stable in the small. 
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Transforming the equilibrium point to the origin (0, • - -,0), 
we have 

§ = FiiXf + *i, • • • , XS + *„), (67) 


n 

and 53 FiXi < 0 for sufficiently small nonzero values of x. 

i 

Multiplying the first equation of (67) by x lt the second by x 2 , 
etc., and summing we find 




= 23 FiX-i < 0. 


( 68 ) 


For sufficiently small values of x, the sum of squares is decreasing. 
Hence, as t goes to infinity, it approaches a limit which cannot be 
different from zero. If 

lim (xi 2 + x 2 2 + • • • + x„ 2 ) = 0, (69) 

t—* 00 

then 

lim Xi = 0, lim x 2 = 0, • • • , lim x„ = 0. (70) 

+oo i — >oo t-+ oo 

Hence, the equilibrium is stable. A relative proper minimum 
yields definitely unstable equilibrium, while a nonextremum sta- 
tionary value yields stability-instability. 

This theorem, while not applicable to all differential equations, 
is nevertheless very important for economic systems. Within its 
scope it is exceedingly general since it does not require that the f’s 
be analytic, and it covers simultaneously the stability of first and 
higher orders. Economically it says that the system is always 
climbing a hill and will approach the top. 


The Difference Equation in one Variable 

The problem of differential systems has been analyzed in a fairly 
complete manner, and we must now turn to systems of difference 
equations, which are perhaps of even greater importance for eco- 
nomic theory. The simplest case is provided by the general non- 
linear difference equation in one variable 

X(t + 1) = f{X(t)}, 


( 71 ) 
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or 

A X(t) = g{X(t)\ = f - X(t). (72) 

So simple is this equation that we can indicate its solution 
graphically, showing on one diagram all possible types of qualita- 
tive behavior in the neighborhood of an equilibrium position. In 
Figure 6 two functions are plotted — one relating the succeeding 



value of X (vertical axis) to the given value of X (horizontal axis) ; 
the other is simply a 45° line. 

The solution of our equation for any initial condition is shown 
by broken lines running between these two functions in the indi- 
cated fashion. Any initial value X ( t ) leads to a new value X(t + 1) 
indicated by running up vertically to the curve f[X(t ) } ; this new 
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value must be transferred to the abscissa in order to derive its 
successor. This is effected by moving horizontally to the 45° line; 
a vertical movement yields the next value, and so the process goes. 
Stationary equilibrium positions are defined by the intersection of 
the / function and the 45° line, or analytically by roots of the 
equation 

f(X) - X - 0. (73) 

The diagram illustrates what are essentially all possible types 
of equilibria. Twenty-four points of equilibrium are indicated, 
but only sixteen represent qualitatively different kinds of equi- 
librium, eight being duplicates. Point A represents a position of 
equilibrium, stable in the small. Displacement in either direction 
results in an asymptotic return to equilibrium. Point B repre- 
sents unstable equilibrium; displacement is followed by an ever- 
increasing one-way disequilibrating movement. C represents a 
position of stable equilibrium, differing from A in that it has first- 
order neutrality and has only high-order stability. D possesses 
first-order neutrality, but high-order instability; therefore, it is 
unstable in the small, and the recession from equilibrium is mo- 
notonic. E has first-order neutrality, but its next nonvanishing 
derivative is even. Consequently, it has one-sided stability- 
instability. 

Thus far we have met only equilibria like those of the one- 
variable differential system. That the single difference equation is 
richer in types of equilibria is illustrated by the variety still to 
come. F possesses first-order stability, and hence is stable in the 
small. Unlike A the approach to equilibrium is not monotonic, 
but rather by means of damped oscillations of period two. G 
represents simply a position of neutral equilibrium in the small, 
possessing first- and high-order neutrality. The system stays 
wherever it is put. H differs from F only in that the equilibrium 
is unstable, the divergence from equilibrium being along explosive 
oscillations; I is simply a duplication of B, as are K, M, O, Q, S, 
U, and W. 

J represents a position of neutral equilibrium. The system 
oscillates with constant amplitude around this position. (In phys- 
ics this would be called stable equilibrium. In the terminology of 
the previous chapter, it has stability of the second kind — not to be 
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confused with second- or high-order stability.) L like J possesses 
first-order neutrality, but unlike J has high-order stability. Equi- 
librium is approached by damped oscillations. N also has first- 
order neutrality, but possesses high-order instability of an oscilla- 
tory nature. P and R like L and N possess high-order, oscillatory 
stability and instability, respectively, but analytically differ slightly. 
T possesses oscillatory neutrality of all orders, and hence is neutral 
in the small. It can be thought of as including / as a special case. 
Finally, V and X, while they possess first-order stability, are 
analytically of singular type and should be differentiated. The 
former has monotonic stability, the latter oscillatory stability. 

The following classification may clarify the possible types. 


Nonsingular A 


Monotonici 


( First-order^ 

[Singular V 
High-order C 
First-order B 

High-order D 
Stable-unstable E 


[Neutral G 


Oscillatory 


Nonsingular F 


Stable 


First-order) 

[ Singular X 
f Type 1 L 
High-order) 


[Type 2 P 

{ First-order H 

("Type 1 N 
High-order) 

[Type 2 R 


[Neutral J, T 


3°6 
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We discriminate analytically between the sixteen cases depend- 
ing upon the first and higher derivatives of f(X). The first sub- 
division between monotonic and oscillatory hinges upon whether 
f(X) is positive or negative. Within the monotonic classification 
we have first-order stability if f is less than unity, and first-order 
instability if f is greater than unity. If f equals unity, we have 
first-order neutrality and must proceed to higher derivatives. If 
all these derivatives vanish, we have true neutrality in the small. 
If the first nonvanishing derivative is odd and positive, we have 
high-order stability; if odd and negative, we have high-order in- 
stability. If the first nonvanishing higher derivative is even, we 
have one-sided stability-instability. 

Within the oscillatory classification things are even more com- 
plicated. If /' is less than unity in absolute value, we have first- 
order stability; if greater than unity in absolute value, we have 
first-order instability. If /' = — 1, then we must proceed to 
higher derivatives. Should all of these vanish, we have neutral 
oscillatory equilibrium (J). More generally, if all odd higher 
derivatives vanish, the equilibrium is neutral and oscillatory as 
in T. When f' — — 1, and the first nonvanishing derivative is 
odd and positive, the equilibrium possesses high-order oscillatory 
stability; if odd and negative, the equilibrium possesses high-order 
oscillatory instability. When/' = — 1, and the first non vanishing 
derivative is even, then we go on to consider the first nonvanishing 
odd derivative. As in the previous cases, we have stability or 
instability depending upon whether this is negative or positive. 

Finally, we are confronted with the singular cases when f = 0. 
If the next nonvanishing derivative is odd and positive, we have 
monotonic stability; if odd and negative, or if even and of any sign, 
we have stable oscillatory equilibrium. If all derivatives vanish, 
the equilibrium is perfectly stable as indicated in the first section of 
this chapter; when displaced, the system returns “instantly” to equi- 
librium and does not merely approach equilibrium asymptotically. 

A few remarks may be in order concerning the qualitative be- 
havior of a first-order system from any initial condition. It could 
no doubt be shown that it must do one of the following : (a) go off 
to . infinity; (b) approach an equilibrium level; or (c) approach a 
periodic motion of some finite period. If it is reversible, that is, 
if f(X) is not only single-valued but admits of a single-valued in- 
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verse, then the only periodic motion possible under heading (c) is 
one of period two. It is not appropriate to investigate here the 
meaning of the stability of motions more general than stationary 
equilibrium levels. When this is done, it will be found no doubt 
that there exist valid “separation” theorems concerning the order- 
ing of stable and unstable periodic motions. The necessity because 
of continuity (Rolle’s theorem, etc.) of duplicating certain equi- 
librium points in the above diagram provides a hint at such 
relationships. 


Analytic Solution 

As with the differential equation systems an exact solution of 
the general nonlinear analytic difference equation can Jpe indi- 
cated. Let 

x(t + 1) = aix(t) + a^xity + a z x(t) z + • • •, (74) 

or 

A x(t) = (®i — l)x(i) -f aix(t) 2 -f- a z x(ty +'•••. (75) 

We rule out cases where the first-degree term vanishes in either of 
these expressions; and we rule out all cases of singular and neutral 
first-order equilibrium so that ai 5 ^ 1, 0, or — 1 . 

A formal solution of this is given by a power series in the solu- 
tion of the simpler linear equation 

x(t + 1) = a x x(t), 

A x(t) = (a,i — l)x(f). 

Let 

gi(t) = aaf. (77) 

Theorem VIII : A formal solution to (74) is provided by 

x(t ) — Ciiaaf} + c z { affix *} 2 + (78) 

This may be verified by formal substitution provided that Oi 5 ^ 1, 0, 
or — 1. Each c is obtainable in terms of all previous o' s and the 
known a’s. 

More generally, we are given n difference equations in normal 
form 

Xi(t + 1) = E dijXj(t) + E a ijk Xj(t)Xk(t) (79) 
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where the latent roots of a, (Xi, • ■ • , X„), are never equal in absolute 
value to zero or one, and where there exist no linear commensura- 
bility relations of the form 

»i log Xi + m 2 log X 2 + • • • + m n log X B = 0, (80) 

for m's integers not all vanishing. Then 

Theorem IX: A formal solution to the system of difference equa- 
tions is given by an ascending power series in the solutions of the 
linear system 

Xi(t + 1) = £ CLijXft), (81) 

3 

or 

Xi(t) = e^aiXf, • ■ a B X n ‘) 

= £ c iS { a 3 V } + £ c ijk { otj'Kf } { ( ) 

3 

Each set of c’s can be determined by formal substitution from all 
previous sets and the known a’s. If some of the roots are multiple, 
or if a linear commensurability relation of type (80) does hold, 
there will probably exist a similar power-series solution augmented 
by terms with polynomial multipliers in t, provided always that the 
absolute value of all roots equals neither zero nor one. 

All of the remarks of earlier sections concerning convergence of 
such series apply. We must remember, however, the first-order 
stability of a difference-equation system implies 

|Xi| < 1, (83) 

and conversely. The following theorem is easily derived along 
now familiar lines. 

Theorem X : For a difference-equation system first-order stability 
is a sufficient condition for stability m the small, and the absence of 
first-order instability is a necessary condition. 

Space does not permit me to illustrate the above remarks with 
an economic example, such as would be provided by the familiar 
cobweb theorem applied to nonlinear supply and demand curves. 

Other Functional Equations 

While this has not been verified, one can hazard the conjecture 
that more general nonlinear functional equations can in the neigh- 
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borhood of an equilibrium point be expressed as a power series in 
solutions of the simpler linear systems. Thus, under suitable as- 
sumptions functional equations can be written in the Taylor-like 
expansion 

X ; (i) + Fixtir), ■■■,Xn(r)\ 

— CO — 03 

= {Xiit) - X/} + £ f Kij(t - t){*,(t) - X/}dr (84) 

j -CO 

+ 1 f f K ijs (t — r 1 , t — r 2 ) {Xy(rx) — Xj 0 } 

j,s v - 00 

•{X s (r 2 ) -X^Mr^Lrt + •••, 

where K ijt Ki js represent first and second functional derivatives 
respectively. The linear system 

Xt(t) - Xi° + Z f Kii(t - t){Xj(t) - X j ° } dr = 0 (85) 

j */-oo 

is known to have solutions of the form 

Xi(t) - X<° = £ an**, (86) 

1 

where the X’s are infinite roots of the transcendental equation 

D{ X) - jS„ + f K i} -(v)e~^dv | = 0. (87) 

Jo 

In population analysis single integral equations of this type play an 
important part, and the solution in terms of an infinite number of 
exponentials has been called by A. J. Lotka 10 the Hertz-Herglotz 
solution of these equations. It can perhaps be shown that an 
infinite power series in the infinite solutions of the linear system 
will provide a solution for the nonlinear case. 

Similarly mixed equations of the type 

y'it) = f{y(t), y(t - 6 ) } (88) 

may be expressible in terms of the Frisch-Holme linear equation 

y'(t) = ay(t) + by(t -6). (89) 

10 For an excellent bibliography of applications see A. j. Lotka, “A Contribution 
to the Theory of Industrial Replacement,’* Annals of Mathematical Statistics , vol. X 
( 1939 ). 
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As in the preceding integral equation, boundary or initial con- 
ditions for this equation necessarily involve an arbitrary function 
over an interval, and hence the exponential solutions must be in- 
finite in number so that an arbitrary function can be expanded in 
terms of them. (The same would be true of difference equations 
not solely defined for integral values of t. Arbitrary periodic func- 
tions would enter into the solution, and these would be expressible 
in the Fourier infinite exponential series.) This raises problems of 
doubly infinite series and cannot be entered upon here. 



CHAPTER XI 

SOME FUNDAMENTALS OF DYNAMICAL THEORY 


In the previous two chapters I have examined a number of eco- 
nomic examples of historical interest drawn from the fields of theory 
and business cycles, and have attempted to suggest the importance 
of dynamical analysis not only for its own sake, but as an aid in 
arriving at useful theorems in the realm of comparative statics. 
Now with less use of illustrative economic material, I should like 
to examine analytically the formal aspects of this problem, at the 
same time indicating possible directions of generalizations 

Statics and Dynamics 1 

Often in the writings of economists the words “dynamic” and 
* 'static” are used as nothing more than synonyms for good and bad, 
realistic and unrealistic, simple and complex. We damn another 
man's theory by terming it static, and advertise our own by calling 
it dynamic. Examples of this are too plentiful to require citation. 

Some writers attempt to distinguish between statics and dy- 
namics by analogy with what they understand to be the relation- 
ship in theoretical physics. That this is a fruitful and suggestive 
line of approach cannot be doubted. But it is too much to suppose 
that very many economists have the technical knowledge necessary 
to handle the formal properties of analytical dynamics. Conse- 
quently, they become bogged down in the search for economic con- 
cepts corresponding to mass, energy, inertia, momentum, force, 
and space. A case in point is Professor Frank Knight’s otherwise 
stimulating essay on Statics and Dynamics ? 

It is certainly true, notably In the writings of Marshall , 3 that 
economists have made use of biological as well as of mechanical 

1 With a few alterations the following section is taken from my article, “Dynamics, 
Statics, and the Stationary State,” Review of Economic Statistics , XXV (1943), 58-^61 . 

2 Chap, vi of The Ethics of Competition (New York, 1935). This is an English 
translation of an article in Zeitschrift fur Nationalokonomie for 1930. 

3 See the references to statical method and biology in the Index to the eighth edition 
of the Principles. In none of his writings does Marshall show more than a passing 
familiarity, such as might be expected of any intelligent layman, of the biological notions 
of his time. Therefore, he could not be expected to have discerned the lasting truths 

3*1 
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analogies, in which evolution and organic growth is used as the 
antithesis to statical equilibrium analysis. In general the results 
seem to have been disappointing; viz., the haziness involved in 
Marshall’s treatment of decreasing cost. And if one examines the 
more exact biological sciences, one looks in vain for any new 
weapon, secret or otherwise, for discovering scientific truths. If 
the bloodstream is capable of a simple, abstract, rigorous descrip- 
tion in terms of the usual laws of physical thermodynamics, so much 
the better; if not, one must be content with more complicated, un- 
wieldy explanations. Indeed, according to the late L. J. Henderson 
the very notion of a stable equilibrium, so characteristic of physical 
theory, was actually first observed empirically in connection with 
the resistance of the human body to disease, and formulated by 
the ancfents as the well-known vix medicatrix naturae . 4 

Nor should the problems encountered in the biological field be 
considered necessarily more complex and less subject to simple 
formulation than those in the physical field. Few biological 
sciences are less “exact” than meteorology, which must certainly 
be included in the physical sciences . 5 Here, simple and abstract 
theories spun out from a few assumptions are likely to be inferior 
to the intuitive hunches of experienced practitioners, but this is 
only a reflection of the present primitive state of the subject. 
New truths are ascertained in the same way as in more advanced 
subjects, and it is to be hoped that rule of thumb may be replaced 
by more exact and unambiguous formulations . 6 

from the fashions of the moment. Nevertheless, writing at the time that he did, it was 
inevitable that he should have been influenced, if not convinced, by the Spencerian 
doctrines popular at the end of the nineteenth century. 

4 See J. A. Schumpeter, The Theory of Economic Development (Cambridge, Mass., 
1934, English edition), Preface, p. xi, for Mill’s notions of statics and dynamics and their 
intellectual origins. 

5 Of course, it can be said that experimentation is not possible in meteorology as in 
other physical sciences. But what about astronomy, in some ways the most exact of all, 
in which no experimentation is possible? 

6 In discussing the limitations of mathematical methods in economics, Professor 
Viner expresses the belief that the biological character of the subject, so to speak, makes 
such methods of limited applicability. By this I take him to mean that the subject is 
complex and difficult, not that any fundamentally different methods of investigation are 
required. See “Marshall’s Economics, the Man and His Times,” American Economic 
Review, NXXI (1941), 223-236. 

Gustav Cassel in his Fundamental Thoughts in Economics (New York, 1925), 
chap, i, considers Economic Dynamics to be a third stage of analysis, following a pure 
Static Economy and a “quasi-static” Uniformly Progressive Economy. 
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Leaving aside all analogies with other fields, there has neces- 
sarily been within the main corpus of economic theory a preoccupa- 
tion with dynamics, if only implicitly. The Classical economists' 
from Smith through Mill had theories of the long-run movements 
of population and accumulation . 7 J. B. Clark rigidly separated 
the static from the dynamic in his thought.® (Other examples 
could be multiplied.) Clark’s celebrated static state and the 
“circular flow” of Professor Schumpeter raise a rather vexing point 
of terminology — the relation of static to stationary — now cleared 
up more or less to the satisfaction of everybody by Professor Frisch. 

Stationary is a descriptive term characterizing the behavior of ! 
an economic variable over time ; it usually implies constancy, but 
is occasionally generalized to include behavior periodically repeti- 
tive over time. Used in this sense, the motion of a dynamical 
system may be stationary: e.g., the behavior of a pendulum satis- 
fying Newton’s Laws of Motion, but subject to no disturbance and 
hence remaining at rest; or the behavior of national income after 
a change in investment has given rise to dwindling transient geo- 
metric progressions of the usual “block-diagram” character. 

Statical then refers to the form and structure of the postulated 
laws determining the behavior of the system. An equilibrium de- 
fined as the intersection of a pair of curves would be statical. 
Ordinarily, it is “timeless” in that nothing is specified concerning 
the duration of the process, but it may very well be defined as 
holding over time. A simple statical system as defined above 
would also have the property of being stationary ; but, as we shall 
see in a moment, statical systems can be devised which are not 
stationary over time. 

In defining the term dynamical, at least two possibilities suggest 
themselves. First, it may be defined as a general term including 
statical as a special rather degenerate case. Or, on the other hand, 
it may be defined as the totality of all systems which are not statical. 
Much may be said for the first alternative; the second, however, 
brings out some points of controversy in the literature and will be 

7 See L. Robbins, “On a Certain Ambiguity in the Conception of Stationary Equi- 
librium,” Economic Journal , XL (1930), 194-214. 

8 J. M. Clark has wished to carry on where his father left off, to construct a dynamics 

which would supplement statics. See J. M. Clark, A Preface to Social Economics (New 
York, 1936). K 
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discussed here. This decision involves no point of substance, since 
only verbal problems of definition are involved. 

We may say that a system is dynamical if its behavior over time 
is determined by functional equations in which ‘ ‘variables at different 
points of time 1 ' are involved in an “essential' ' way. This formulation 
is to be attributed to Professor Frisch . 9 Special examples of such 
systems are those defined by difference equations, i.e., those in- 
volving a variable and its lagged values ; integral equations in which 
the preceding values of the variable enter in a “continuous” way. 
By a liberal interpretation of the circumlocution “variable at a 
different point of time,” we may bring differential equations under 
the definition, remembering that differential coefficients character- 
ize the behavior of a function in the neighborhood of a point. 
Mixed types and more general functionals are included. 

Att ention is called to the fact that variables at different points 
of time must enter into the problem in an essential way. Thus, 
a system involving a rate of production per unit time, i.e., a time 
derivative, may yet be statical. This is because the variable of 
which the rate is the time derivative may have no economic sig- 
nificance. It can be interpreted as the cumulated amount of pro- 
duction from the beginning of time or from an initial date; no 
significant economic process depends upon this variable. The ne- 
cessity for the present insistence may be apprehended if it is realized 
that every variable can be written as the derivative of something, 
namely its own integral. Moreover, a system may be pseudo- 
dynamic in the sense that formal manipulation of it permits us to 
reduce it to statical form. Unless, therefore, we reserve the desig- 
nation dynamics for systems which involve economically significant 
variables at different points of time in an irremovable way, we shall 
find that no nondynamic systems exist. 

According to the present definition the historical movement of 
a system may not be dynamical. If one year the crop is high be- 
cause of favorable weather, the next year low, and so forth, the 
system will be statical even though not stationary. The same is 
true of a system showing continuous growth or trend, if the secular 

9 Ragnar Frisch, “On the Notion of Equilibrium and Disequilibrium,” Review of 
Economic Studies , III (1935-36), 100-106. 
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movement is taken as a datum and if the system adapts itself 
instantaneously. 10 

On the other hand, a truly dynamical system may be com- 
pletely nonhistorical or causal, in the sense that its behavior de- 
pends only upon its initial conditions and the time which has elapsed, 
the calendar date not entering into the process, T'or many pur- 
poses, it is necessary to work with systems which are both historical 
and dynamical. The imp act of technological change upon the 
economic system is a case in point. Technological change may be 
taken as a historical clatum, to which the economic system reacts 
noninstantaneously or in a dynamic fashiorh _ Another instance 
is provided by a business cycle of a regular periodic character, 
which results from impressing an oscillatory outside force upon a 
mechanism with an intrinsic (damped) period of its own. 

We may distinguish, then, four distinct cases made up of all 
possible combinations of static-dynamic and historical-causal: 

1 . Static and stationary 

2. Static and historical 

3. Dynamic and causal (nonhistorical) 11 

4. Dynamic and historical 

Almost all systems can be placed in one of these categories; and, 
depending upon the point of view or purpose at hand, the analysis 
may be formulated so as to put a given system arbitrarily in one 
category rather than another. Thus, if a system is very heavily 
damped so that it approaches its equilibrium value extremely 
rapidly, its dynamic features may be passed over in order to 
simplify the analysis. 

Or a system which is causal from a very broad viewpoint may 
be regarded as historical if certain movements are taken as unex- 
plained data for purposes of the argument. (In fact, every his- 
torical system is to be regarded as an incomplete causal system.) 

10 1 conceive Henry Moore’s moving equilibrium to be of this statical type, although 
the movements around the secular trend are dynamic in character. H. L. Moore, 
Synthetic Economics (New York, 1929). 

11 The notion of causation in a closed interdependent system is exceedingly slippery 
and ambiguous. As used here, a system is said to be causal if from an initial configura- 
tion it determines its own behavior over time. While it is not appropriate to say that 
one subset of variables causes another to move, it is permissible to speak of a change in 
a given parameter or datum as causing changes in the system or in its behavior over time. 
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To a meteorologist-economist a business cycle caused by weather 
disturbances and sunspots would constitute a causal process. But 
ordinarily the economist is willing to regard the causation as uni- 
lateral and to adopt a division of labor in which he does not study 
astronomy but considers his job as done when he has pushed 
economic analysis to a “noneconomic” cause. 12 However, there 
is nothing sacred about the conventional boundaries of economics ; 
if the cycle were meteorological in origin, economists would branch 
out in that direction, just as in our day a political theory of fiscal 
policy is necessary if one is to understand empirical economic 
phenomena. 

An important class of phenomena cannot conveniently be 
brought under the above four categories. I refer to dynamical 
stochastic processes such as that realized if a damped pendulum is 
subjected to “random” shocks. We shall have reason to discuss 
such processes in connection with dynamical problems arising in 
the study of the business cycle. 

For a given pattern of shocks, as determined by the particular 
workings of chance in the sequence of time under discussion, we 
have simply a dynamic historical system of type 4 above. But if 
we consider the totality of all possible shocks which may be ex- 
pected to occur if they are regarded as random draws from a fixed 
universe, it is clear that calendar time is not really involved, but 
only the time which has elapsed since the beginning of the process. 
In this sense it is like type 3 rather than type 4, although the word 
causal may no longer seem appropriate. 

It seems best, therefore, to specify two more categories: 

5. Stochastical and nonhistorical 

6. Stochastical and historical 

The latter occurs when we have a dynamical system containing 
stochastical variables, and where either the structure of the system 
varies in an essential way with time, or where the universes charac- 
terizing the random variables change in an essential way with time. 
Simple examples of types 5 and 6 are provided by 

ilf t~^~ 1 — — 2 t f v 

Z 4+1 = iX t + t + [m t \, W 


w J. A. Schumpeter, Theory of Economic Development , chap. i. 



FUNDAMENTALS OF DYNAMICAL THEORY 317 

where in the first case the random variable h which may appear at 
any instant of time is drawn from an unchanging universe; and 
where in the second case the random variable m is drawn from a 
universe defined differently at each instant of time. 13 

In his recent book Value and Capital , 14 Professor Hicks has 
given an exceedingly simple definition of dynamics: “I call Eco- 
nomic Statics those parts of economic theory where we do not 
trouble about dating ; Economic Dynamics those parts where every 
quantity must be dated” (p. 115). 

In terms of the above six categories this definition is overly 
general and insufficiently precise. The second category consisting 
of historically moving static equilibria would certainly require 
dating of the variables, but it would not thereby become "dynamic. 
My objection is to his definition, not to his practice, for many of 
the systems which he analyzes are in the strict sense dynam ic. 


Causal Systems 

By a complete causally-determinate system I shall mean one be- 
longing to the third category of the above classification, whose be- 
havior is determined by initial conditions (in the broadest sense) 
in such a way that its behavior depends only upon the time which 
has elapsed since the establishment of such initial conditions. 
That is to say, the specification of similar given initial conditions 
at a later period of time would result in a similar evolution of the 
system except at a constantly later time period. Mathematically, 
if x represents initial conditions at time t°, our solution takes 

13 In the above form the initial formulation of stochastic processes seems to be due 
to G. U. Yule. See the references to the work of Yule, Slutsky, and Frisch in H. Wold, 
A Study in the Analysis of Stationary Time Series (Uppsala, Sweden, 1938). However, 
the notions of a Markoff Chain, a Brownian movement, etc., go far back in the literature 
of mathematics. See S. Chandrasekhar, “Stochastic Problems in Physics and Astron- 
omy,” Reviews of Modern Physics , XV (1943), 1-89. 

As is noted later, many systems which involve stochastical processes can neverthe- 
less be described in part by nonstochastical models of the first four types discussed above. 
Often probabilities and statistical parameters may themselves be treated as causally 
determined coSrdinates. Thus, if it is known that the first system described in equation 
(1) takes on the value k at time 0, and that the mean and standard deviation of h are 
respectively a, &, then the expected (mean value) of Xt is given by the solution of a 
causal dynamical equation like that of (1) with the stochastic term removed. 

14 J. R. Hicks, Value and Capital (Oxford, 1939). 
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X = f[t ~ t°;x(y)2, (2) 

where x and x stand for a finite or even infinite set of variables. 

Without this restriction our system would be a historical one 
depending in an essential way upon the time at which the initial 
conditions are specified, and could only be written 

x = f[t\ t°; 2(£ 0 )]. 15 (3) 

Thus, the differential equation 

x + x = 0 (4) 

does not contain time explicitly, and its solution is of the form 

x{t) = (5) 

while 

x + x = t (6) 

has the solution 

x(t) = (t - 1) + [*(f) + 1 - (7) 

The first is a complete causal system; the second is not. For 
the first a specification of the initial conditions (value of x) and 
a knowledge of the time which has elapsed is sufficient to locate the 
quantity x; in the case of the second it is necessary as well to know 
the historical date at which the initial conditions are preassigned. T 

15 This represents a one parameter, not a two parameter, family of curves. A change 
in t° introduces no new solutions, but simply alters the initial conditions z(t°) which 
identify each. 

16 Complete causal systems are essentially unchanged by the transformation 

i! = t "T cL y 

but become historical ones in the time scale defined by the transformation 

t' - Ht), h"(t) 0; 

although for simple scale changes dimensional constants serve to maintain all essential 
invariances. Conversely, in special cases there may exist transformations (of time) 
which convert a historical system into a causal one; viz., the historical system 

dx , 

,- + * = ° 

becomes after the transformation 

t « where t r « log t, 

a complete causal system in the new time namely, 
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It should perhaps be explicitly mentioned that no metaphysical 
significance attaches to this definition. Not only is an arbitrary 
time scale involved, but also any historical system may be regarded, 
as we shall later see, as an “incomplete” causal one. Furthermore, 
the elements of our system may very well be probabilities, which 
are themselves determinate although the value of certain variates 
may be uncertain. 

Merely from the definition of a causal system certain interest- 
ing properties become obvious. If within a finite period of time a 
system returns to exactly the same initial conditions from which it 
started, its motion must be perfectly periodic, since it must again 
do the same, etc . 17 Then again, a determinate economic system 
cannot on the downswing pass through precisely the sgune con- 
figuration as it did on the upswing. Some of the relevant variables 
(costs, prices, etc.) must present a different pattern or else relevant 
dynamic aspects of them must (time derivatives, lagged values, etc.) . 

Let us now consider some definite causal determinate systems. 
An investigator interested in only certain aspects may wish to 
concentrate upon a particular subset of variables, neglecting their 
mutual interdependence with the totality of all. To a first ap- 
proximation the excluded ones may be assumed to be arbitrarily 
given functions of time. The system studied is essentially an “in- 
complete” causal one and as such is indistinguishable from any 
other historical one. Thus, given three bodies, one of large mass 
compared to the other two and of great distance from them, it is 
often convenient to analyze the behavior of the two smaller ones 
on the assumption that they are in a gravitational field changing 
with respect to time (as the external body pursues its “relatively” 
independent path) ; although in fact the three bodies together form 
a gravitational field independent of time. 

Which variables will be taken as data, and which as unknowns 
to be analyzed, will depend in each case upon the purpose at hand 
and upon a diagnosis of the particular interrelations present. 
Often the economist takes as data certain traditionally noneco- 
nomic variables such as technology, tastes, social and institutional 
conditions, etc. ; although to the students of other disciplines these 
are processes to be explained and analyzed, and are not merely 

17 G. D. Birkhoff and D. C. Lewis, Jr., “Stability in Causal Systems,” Philosophy 
of Science, II (1935), 304-333. 
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history. 18 On the other hand, one may limit the scope of one’s 
inquiry within the economic sphere. Thus, in studying a hypo- 
thetical short inventory cycle one might take longer waves as given. 
Then again, it is often necessary to go beyond the realm of tradi- 
tional economic variables in throwing light upon a particular proc- 
ess; especially often, for example, into the political sphere. The 
world rarely fits into the taxonomic classifications of pedagogues. 

Among the numerous reasons for electing to regard certain 
variables as independent or arbitrary parameters, one in particular 
deserves attention in connection with dynamic analysis. Certain 
processes are conceived to move slowly as compared to others. 
Thus, we distinguish long run tendencies from shorter run tenden- 
cies, and so forth in infinite regression. The interplay of these 
processes will be given in more detail later. 

Stationary States and their Generalization 

In chapter ix a stationary or equilibrium state of a dynamic 
system of n variables defined by n functional equations of the 
general form 

F[xi(r), • • •, x«(r), t\ — 0 (i = 1, • • n) (8) 

—00 —00 

was provided by a set of constants (xf, ■ ■ ■ , x n °) for which 

F%x i°, • • • , x n °, t ) s= 0. (9) 

—00 —00 

Even if the system is not a complete causal one, i.e., if it in- 
volves time explicitly, it may still possess one or more stationary 
equilibrium levels. Thus, the system described in footnote 16, 
p. 318, is historical, but has the stationary equilibrium level x = 0. 
However, only in exceptional cases will a historical system possess 
stationary equilibrium positions ; and as pointed out in chapter ix, 
even complete causal ones may lack such positions. It is desirable, 
therefore, to consider the manner in which a stationary equilibrium 
can be generalized. 

If the variables of our system are “dated,” and involve time 
explicitly, but are not truly dynamic in the Frisch sense of involving 
variables at different points of time (derivatives, integrals, and 

18 See J, A. Schumpeter, The Theory of Economic Development, chap. i. 
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more complicated functionals), the manner of generalization is 
obvious. Our historic (but Frisch-static) system is of the form 1 


F{X, t ) = Ffxu ■ ■ 0 = 0. 19 {i = 1, • • 

-,n) (10) i 

i 

A solution of this system 

„ j 

X = X°it) 

(11) f I 

such that 

1 

0 = 0' 

(12) If 


may be termed the moving equilibrium. However, this is almost 
trivial since the moving equilibrium is the only possible evolution 
of the system . 20 Let us turn, therefore, to truly dynamic historical 
systems. n 

For definiteness, let us contemplate the dynamical system as- 
sumed by H. L. Moore in his pioneering attempt to determine em- 
pirical supply and demand elasticities. While he adopted a general 
equilibrium approach in which the quantity of each good depends 
upon all prices, the formal difficulties of the problem will be lessened 
if we concentrate upon a partial equilibrium market involving only 
one good q and its price p. 

The demand for the good is a functional relationship between 
its quantity, its price, and time, 

Qt = D{p h t). (13) 

Quite inadvertently, and probably for the purely statistical purpose 
of deriving the elasticity of supply simultaneously with the demand 
elasticity, Moore assumed that the quantity supplied depended 
upon time and price at a previous period of time, 

q t — S(p t -i, t). (14) 

The two equations together form a dynamical system which 
determines the evolution of (p, q ) for given initial values of p or q\ 

19 1 shall have frequent recourse to matrix notation whereby a set of variables 
(x h * * • , x„) will be represented by (X), a set of functions *»#«)»* \*J n (x h - * •,#«)] 

by f(X), etc. Thus, f(X) = 0 implies f (x lf •••,*») = <), where (i = 1, •**,»). 

20 Quite possibly one might imagine random or chance errors causing deviations 
from the equilibrium defined by the equations (10) so that the moving equilibrium repre- 
sents a smoothed out trend of the actual observations. One might have f(x -f e, t) = 0, 
where e is a random variable. 
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namely, 

pt = fO,P(t°)J n .. 

it - g\j, mi u J 

There exists a one parameter family of motions; each motion is 
determined by efficacious forces, but not all can be termed moving 
equilibria or the term loses significance. 

If time did not enter explicitly into our equations, i.e., if neither 
the demand curve nor the supply curve were shifting, the stationary 
equilibrium would clearly be defined by 

D{p°) - = 0, 

S(p°) - q° = 0. 

. For this Steady motion 

pt = Pt - 1 = • • • = P°, 

it = qt - 1 = • • • = q°, 

Apt = Agj = 0. 

When time is explicitly involved, what is the analogous position 
of moving equilibrium ? Let us consider the following alternative 
definitions which have been advanced: (1) the definition which 
Moore seemed to urge whereby the position of moving equilibrium 
is to be represented as a statistical trend; (2) the moving equi- 
librium defined by the “equality” of supply and demand; (3) the 
moving equilibrium defined by the “Frisch barring process,” to be 
described later. 

1. The representation of equilibrium as simply a statistically 
fitted trend has apparently no universal validity. In particular, 
if this method is applied to the causal case (especially where the 
approach to equilibrium is nonoscillatory) , it need not lead to the 
correct equilibrium level, nor necessarily to any stationary level. 
Probably its apparent acceptance by Moore rested in part upon its 
supposed relation to the criterion to be described next. For 
stochastical systems, discussed later, trend-fitting is more de- 
fensible, but still not optimal. 

2. The criterion that moving equilibrium be defined by equi- 
librium of supply and demand seems on first sight a natural general- 
ization of the stationary case. But, upon examination, ambiguities 
appear in this formulation. Given any price, demand reacts in- 


(16) 

(17) 
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stantaneously, supply after one year. What sense is there in 
equating the two ? (Of course, each year in the shortest run price 
is determined by the equality of demand and short run supply for 
all possible motions.) Nevertheless, if the shifts are very “slow,” 
one might as a first approximation disregard the differences be- 
tween p t and p t - 1 and equate 

D(p t , t ) = SiPt-u t) = S(p t - Apt, t ) = S(p h t) (18) 

on the assumption that Ap t = 0. Solving 

Dipt, t) - Sip t - 0, 0 = 0, (19) 

we get 

Pt = Pxit), (20) 

and this might be termed the moving equilibrium for price. Para- 
doxically, on the assumption that price is not changing, we derive 
a moving equilibrium path for price ! But this is characteristic of 
the method of successive approximation. Assumptions known to 
be in error are made, later to be corrected. A second approxima- 
tion would be derived by making 

A p t = A (21) 

and equating 

D[_pft), 0 - S[p 2 it) - Apiit), Q = 0, (22) 

and the nth approximation by the relation 

Dlpnit), 0 - Slpnit) - A pn-lit), 0 = 0. (23) 

If the sequence of functions [_piit)~\ converges uniformly to a limit 
function pit), then pit) is a solution of our original system. 

The method outlined above is essentially identical with the 
moving equilibrium of a biological or chemical system undergoing 
slow changes. 21 If the system is defined by 

x = fix, t), (24) 

where ft is small, the moving equilibrium (first approximation), 
Xi it), is defined by 

- = 0 = fix i, t); Xo = constant; (25) 

21 See A. J. Lotka, Elements of Physical Biology (Baltimore, 1925), chap. xxi. This 
contains numerous references. 
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the second approximation, Xi(t), by 

xi = t) ; (26) 

and the nth by 

x n -i = f(x n , t). (27) 

The sequence of functions | >,(£)] may or may not converge uni- 
formly to a limit function; if it does converge, the limit function is 
one solution of the original. (Instead of a single variable any 
number could be involved without affecting the argument.) 

The above definition has the following advantages in its favor. 
For nonhistorical causal systems not involving time explicitly, it 
does yield the correct positions of stationary equilibrium at the 
first approximation. Also the successive approximations are in- 
variant finder changes of variables of the form 

y = f(x ) ; x = 

and changes of time 

t' = g(t) ; t = 

Nevertheless, these advantages do not outweigh the overwhelm- 
ing shortcomings involved in this definition. What after all is 
being approximated ? Suppose all solutions of equation (24) were 
known exactly to be of the form 

x = i p(t, x). (28) 

There would then be no need for approximations at all. Which of 
these solutions could legitimately be termed the moving equilib- 
rium? The particular method of successive approximation out- 
lined above, if it converges at all, selects out arbitrarily and 
mysteriously the particular solution to be dignified by this title. 22 
I reject, therefore, the above method of approximation as defining 
a unique moving equilibrium, although it may be appropriate for 
arriving at particular solutions. 

For a linear differential equation of the form 

x — a{t)x = b(t), ( 29 ) 

the general solution equals any particular solution plus the general 

22 The successive functions in the approximation sequence* do not in general 

satisfy the same initial conditions so that only in the limit can one determine which 
initial conditions the final solution attained will satisfy. 
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solution of the reduced homogeneous equation 

x — a(t)x = 0. (30) 

Let 

x = cu(t ) (31) 

represent the general solution of (30), where c is a parameter 
specified by initial conditions. Let viit) be a particular solution 
of (29). Then 

i>i(f) T cu(f) (32) 

is the general solution of (29). Clearly also 

£ui(/) -j- au{t ) ] T ku(t) (33) 

is a general solution since the expression in brackets is a particular 
solution for a specific a. 

In some fields (electrical network theory, etc.) special signifi- 
cance is attached to that particular solution which “contains” no 
terms of the form [~a«(2)], and it might be thought that this might 
be an acceptable definition of moving equilibrium. Where the 
functions involved are not the simple elementary ones, the above 
criterion does not specify unambiguously a unique function. But 
it can often be modified to do so. In fact, the limit function 
defined by convergent approximation sequences of the type de- 
scribed in the previous section often represents this particular solu- 
tion. I am not aware, however, of any useful purpose to be served 
by embracing this definition. 23 

There is one interesting interpretation of the first approximation 
which deserves detailed attention. One often encounters the no- 
tion of an equilibrium which, before it is reached, recedes because 
of historical perturbing forces. Imagine a boy carrying a stone at 
the end of a piece of string. If he were standing still, equilibrium 
would be attained when it hung vertically at rest. But as he walks 
the stationary equilibrium is never attained, the position in which 

23 In linear systems (electrical, etc.) where there is an impressed periodic force, the 
general solution of the differential equations can be written as the sum of a purely 
periodic motion and a transient. In the case of damped systems the transient necessarily 
goes to zero in the limit, and the motion necessarily approaches the purely periodic func- 
tion. For some purposes it might be convenient to define the purely periodic function 
as a “moving equilibrium” and give it privileged treatment, particularly if the periodic 
function is a pure sine wave. 
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equilibrium might be possible if his motion were to stop being 
constantly displaced. 

We have here what may be termed receding equilibrium. For 
the time i, it is the position of stationary equilibrium which might 
be attained if subsequent to i all historical changes were suspended. 
That is to say, in all our functional equations a bar is placed over 
t wherever it occurs explicitly for times subsequent to i. The result 
is a hypothetical or virtual complete causal system. For a new t, 
we have a corresponding virtual equilibrium, and thus is defined 
our receding equilibrium as a function of time. 

Let us examine a simple example to illuminate this concept. 
Let 

x " t (o4) 

• 

be a historical system. At t equal to i, the receding equilibrium 
is the stationary equilibrium corresponding to the causal system 

§ + x(t) = t, (35) 

where the right-hand member is treated as a constant. Clearly, 
such a position is given by 

0 + x = t. (36) 

But this is precisely the first approximation Xj(^) of equation (25), 
specifically in this case 

^ = 0 = t - (37) 

The virtual path of the system after the freezing of historical 
change is only one possible solution. For other initial conditions 
at time i, different solutions will be defined. In particular, let us 
consider initial conditions u(i, x), where u is a true solution of the 
historical system (34). The corresponding virtual path may be 
written 

x = v[t, u(i, £)]. (38) 

For the specific system defined by (34) 

u(t , x) - t — 1 -f (* + l)e~ ( , 
v\_t, u(i, £)] — ( — 6 l ~h l)s —< "f- t. 


(39) 

(40) 
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The actual solution of the historical system is seen to be an envelope 
to the family of virtual solutions v\j, u(i, x) ]. The virtual move- 
ment for short times approximates closely the true movement in 
that they are mutually tangent. That is, if the boy were suddenly 
to stop walking, the pendulum -would change its motion; but the 
new motion would be tangential to the old, and for short intervals 
they would not diverge much. 

The virtual curves approach asymptotes or hypothetical sta- 
tionary equilibria. These equilibrium values plotted against time 
form the curve of receding equilibria. This function of time is not 
in general a solution of the true dynamical equations. 24 The sta- 
bility of the hypothetical equilibrium can, however, be determined, 
and, as will be seen later, bears some relation to the stability of the 
actual motions. 

3. I turn now briefly to the “normal” values or “equilibrium” 
values defined by Professor Frisch. 25 This is defined by relaxing 
a subset m of our n functional equations 

flxi(r), • • • , xf(r), t] = 0, {i = 1, • • •, n) (41) 

—co —00 

and replacing them by a set of m hypothetical equations 

g l '(xx, • • • , x n , t) =0. (42) 

—00 —CO 

In the new set of n equations the instantaneous forms of the vari- 
ables (at least in some of the places where they occur) are barred 
and regarded as unknowns. Everywhere else (in all dynamic 
forms of the variables and possibly in some places where the in- 
stantaneous forms appear) is substituted one solution of the original 
set of functional equations defined for a particular set of initial con- 
ditions. This yields n historical equations of the special form (10) 

24 It is possible for the receding equilibrium to be one solution as, for example, in 
the system 

y + y =■*. 

y = t is the receding equilibrium and at the same time an actual motion of the system. 
If the boy has always been walking at a steady pace, the pendulum may be hanging 
vertically. 

25 R. Frisch, “On the Notion of Equilibrium and Disequilibrium,” Review of Eco- 
nomic Studies , III (1936), 100-105. 
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in the barred variables, (xu •••,£»). This defines the moving 
equilibrium. 

Professor Frisch is well aware that this definition does not 
correspond in the stationary case to the concept of equilibrium ; he 
even thinks that the former will eventually replace the latter “as 
the tendency to formulate the economic reasoning in exact dynamic 
mathematical terms gains ground.” 26 But this concept, he be- 
lieves, is what many modem writers have in mind. 

Whether or not this last conjecture is true, I do not know, par- 
ticularly since I am unacquainted with the original Scandinavian 
writings. 27 But for the present purpose this is irrelevant; it is 
only important to point out the differences between this concept 
of a “nognal” and the moving equilibrium for which we are looking. 

Some of the salient properties of the “normal” movement are: 

(1) It is in general not a solution of the original set. There 
exists no set of initial conditions which would produce it in actuality. 

(2) It is not unique, but depends upon as many parameters as 
are necessary to specify initial conditions for the original set of 
equations. This is because it depends upon the particular solution 
— out of an infinity of possible solutions — inserted in the modified 
set of equations / and g. (See the italicized sentence above.) 

(3) Being defined by essentially static, albeit possibly historical 
equations, there is no sense in asking questions concerning its 
stability. 

For all these reasons it is unsatisfactory as a representation of 
moving equilibrium for our present purposes. 28 

26 Ibid., p. 102. 

27 Professor Frisch illustrates this concept by referring to the Wicksellian relation 
between the actual and natural rate of interest. In so rationalizing the arguments of 
some neo-Wicksellians he is, I fear, over charitable, ascribing to them an undeserved 
degree of sophistication. Actually, Wicksell himself considered for the most part a 
system of implicit dynamic relations not involving time explicitly, i.e., without historical 
change. (Witness, for example, his controversy with Professor Davidson on the equi- 
librium requirement of stable prices in a system undergoing output expansion because 
of irreversible technological change.) Within a nonhistorical framework it is possible 
to construct dynamic models of the Wicksellian system in which the natural rate of 
interest represents the stationary equilibrium level of the system and not the solution 
of a hypothetical alternative system. If this be granted, what is the corresponding 
condition for a system in which elements of historical change have been introduced? 
This brings us back to our original query. 

28 Still another notion of moving equilibrium is suggested by an explosive exponential 
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Resolution of the Problem 

We have examined in turn alternative definitions of moving 
equilibrium and found good reasons for rejecting each one. This 
suggests a reexamination of the purpose for which it is desirable to 
find a concept of moving equilibrium. 

It will be remembered that in the case of nonhistorical causal 
systems there was found to be no ambiguity in the definition of 
a stationary equilibrium state ; in fact, it was the hope of generaliz- 
ing the notion of a stationary state for historical systems that 
motivated the search for moving equilibria. 

Let us go back a step and ask why we were interested in a 
position of stationary equilibrium. Clearly it is only one out of 
an infinity of possible motions of the dynamical system in question. 
But, and this suggests the answer to our question, if the stationary 
equilibrium is stable, all motions approach it in the limit . 29 

We were concerned then with the behavior of all the motions 
of the dynamical system ; and it was simply in the nature of a co- 

solution of a causal system such as 

y - y = 0 . 

This has solutions of the form 

y — ke K 

On the above definition every motion but the stationary equilibrium one would be a 
moving equilibrium. By definition all the moving equilibria would be unstable. The 
effect of small shocks would be multiplied through time. On the other hand, the variable 
( y/y ) would be relatively stable. 

The above concept which applies, for example, to a population with constant specific 
fertility and mortality is in contrast to the moving equilibrium represented by the 
growth curve of an infant. Given a small shock (measles, etc.), the child's weight will 
deviate from “normal” growth, but afterwards will catch up so that no evidence of the 
interruption will remain in the later progress of the child. Similarly, an economy re- 
covers from the effect of (say) a war and continues on its secular path. But the popula- 
tion referred to above is forever different in absolute numbers after a shock (war, 
etc.) even though its relative age distribution and other features may again approach 
stable form. 

29 Let x°(t) = x° represent the equilibrium position of a system. Let x i {t) represent 
any other motion. Then, if stable, 

lim x i {t) — x ° ; 

i.e., for any given positive e, however small, there exists a t° such that 
| x i {t) — x°| < €, for t > t°. 

But this is symmetrical in x { (t) and x° so that the equilibrium position can be said to 
approach any motion, as well as vice versa. It easily follows that x*(t), any motion, 
approaches x x (£), a particular motion selected arbitrarily. 
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incidence that this problem could be studied by an examination of 
the properties of a single special one. It will perhaps be enlighten- 
ing to point out that even in the nonhistorical case we can just as 
well concentrate upon any other single motion than the equilibrium 
one. For if they all approach the equilibrium motion, then they 
all approach each other. And every motion must approach any 
one selected at random. 

This suggests the following answer to our dilemma. We shall 
concern ourselves not with the stability of a particular motion of a 
historical system which can be given the privileged title of moving 
equilibrium, but rather with the stability of each and every motion of 
the system. This will not preclude our designating for special at- 
tention particular motions with special properties (as, for example, 
strictly periodic motions, steady motion in which some coordinate 
increases linearly, etc.). 

This suggests how I would answer the purely verbal problem : 
what processes shall be designated as equilibrium processes ? Once 
we have stripped the equilibrium concept of normative and teleo- 
logical connotations, it does not much matter how we apply the 
term. The term may be applied to stationary values only. From 
this narrow point of view a competitive industry would be in equi- 
librium only after all long run conditions had been satisfied, entry 
of firms, correct output for each firm, etc. 

For other purposes we might apply the term to those inter- 
mediate short run situations in which each firm is producing at 
price equal to marginal cost (and above average variable cost), 
even if price diverges from average cost so that the number of 
firms is changing or is about to change. It might be reserved for 
situations of the last described type where the further condition is 
realized that the number of firms is changing at a causally deter- 
mined rate. Still others may be willing to apply it to momentary 
conditions of supply and demand including variations in speculative 
stocks of the good in question. 

I, myself, find it convenient to visualize equilibrium processes 
of quite different speed, some very slow compared to others. 
Within each long run there is a shorter run, and within each shorter 
run there is a still shorter run, and so forth in an infinite regression. 
For analytic purposes it is often convenient to treat slow processes 
as data and concentrate upon the processes of interest. For ex- 
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ample, in a short run study of the level of investment, income, and 
employment, it is often convenient to assume that the stock of 
capital is perfectly or sensibly fixed. Of course, the stock of capital 
from a longer run point of view is simpfy the cumulation of net 
investment, and the reciprocal influence between capital and the 
other variables of the system is worthy of study for its own sake, 
both with respect to a hypothetical final equilibrium and the simple 
course of growth of the system over time. 

So to speak, we are able by ceteris paribus assumptions to dis- 
regard the changes in variables subject to motions much “slower” 
than the ones under consideration; this is nothing but the “per- 
turbation” technique of classical mechanics. At the same time we 
are able to abstract from the behavior of processes much “faster” 
than the ones under consideration, either by the assumption that 
they are rapidly damped and can be supposed to have worked out 
their effects, or by inclusion of them in the dynamical equations 
(derivatives, differences, etc.) which determine the behavior of the 
system out of equilibrium. 

The first of the above mentioned alternatives constitutes the 
justification for the use of comparative statics rather than explicit 
dynamics. If one can be sure that the system is stable and strongly 
damped, there is no great harm in neglecting to analyze the exact 
path from one equilibrium to another, and in taking refuge in a 
mutatis mutandis assumption. Of course, if one chooses to neglect 
certain dynamic processes, one may still retain others; e.g., in 
studying capital formation over two decades I may choose to 
neglect inventory fluctuations, but still may retain the acceleration 
principle in its secular aspects. 

Under the second alternative where shorter run processes are 
contained in (say) the differential equations of the system, it is to 
be understood that these differential equations do not necessarily 
hold exactly at each instant of time. There may well be a still 
shorter run theory which explains how still higher differential equa- 
tions lead to (rapidly) damped approaches to the postulated differ- 
ential equation relations. And so forth in endless regression. 

It may be argued that so general a connotation is at variance 
with traditional usage of the word equilibrium. Is it not straining 
language to think of a cannon ball as being at equilibrium, not 
only after it has fallen to the ground at rest, but also at every point 
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in its flight, when it is on its mean trajectory as well as in its pre- 
cession around this path? Perhaps such terminology may occa- 
sionally lead to confusion ; however, with carefully stated qualifica- 
tions it may be convenient. 

To examine the stability of any motion • • • , u n )~\ simply 

substitute into the functional equations 

Xi(t ) = Ui(t) 4- (43) 

where the above equation is simply a definition for the tj’s. There 
result n functional equations in the i?’s since the u’s are given func- 
tions. Because the u’s constitute a solution of the original system, 
there exists a solution in the rj’s of the form (0, 0, • • - , 0). The 

original solution will be stable if for all possible initial conditions 

• 

lim 7ji(t) = 0. {i = 1, • * •, n) (44) 

/—►GO 

Moreover, provided the original functional equations are of a 
very general class (differential equations, difference equations, in- 
tegral equations, etc.), the resulting equations can for sufficiently 
small displacements be regarded as linear in the rj’s. They are not, 
however, independent of time, but ordinarily will involve it 
explicitly. 

This is most easily seen for a single variable. Let our implicit 
functional equation be 

/O(t) + rj(r) ; t] = 0, (45) 

— oo 

where 

flu 0 ) ; 0 = o, (46) 

—co 

because u is a solution. Providing certain assumptions are made 
concerning the continuity of the functional f and its functional 
derivatives of higher order, there exists an expansion very much 
like Taylor’s expansion for ordinary functions of a point. This 
takes the form 

fl u ( T ) + 5 ?( t ); t] = 0 + f Kft, Ti)y]{ji)dr i + • * •, (47) 

— CO J -V, 

where K\ is the functional derivative of /. This seems to result 
only in linear integral equations, but by use of the Stieltjes integral 
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or the Dirac operator 8(x) with the property that 


and 


£ 


f(x)8(x — a)dx — f(a), 


f f(x)8'(x - a)dx = - f(a), 

«£—oo 


(48) 

(49) 


the general linear differential and difference equations can easily 
be written as linear integral equations. Thus, 


takes the form 


where 


dx 

dt 


+ x{t) = 0 


£ 


K(t, r)x(j)dr = 0, 


K(t, r) = — d'(r — t) T- 6(r — t). 


(50) 

(51) 

(52) 


The convertibility of the problem of stability at least in its 
first order aspect into an examination of linear systems is of in- 
estimable value since the largest part of existing mathematical 
knowledge relates to such systems. 


Concepts of Stability 

In chapter ix there wms some discussion of various kinds of 
stability. Our treatment would not be complete without at least 
a cursory survey of the various senses in which this term lias 
been used. 

(a) It has sometimes been used in a very broad sense. Any 
position of equilibrium is stable if deviations from it remain 
bounded. If no motions go out to infinity, then each is stable. 
For many purposes this is unsatisfactory. Better names than 
stability can be suggested for this property. An egg balanced on its 
tip upon a level plain would be stable in the above sense. 

(b) Poincare implied still another definition when he coined 
the phrase stability in the sense of Poisson . 'This means that, 
although the system may not exactly repeat itself starting from an 
arbitrary initial state, yet in general it will return to the vicinity 
of Its Initial state and nearly repeat its motion during a long interval 
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of time.” so This is quite different from the usual meaning of 
stability, and the term recurrent is perhaps more appropriate to 
systems employing this property. 

(c) A very common use of stability in nonconservative physical 
systems is the one which I have termed stability of the first kind. 
It holds when every motion approaches in the limit the position 
of equilibrium (and every other motion). It is not reversible in 
time; going backwards, all stable systems become unstable . 31 

Subdivisions under this heading are of course possible. Thus, 
there is stability of the first kind in the small if in a sufficiently 
small neighborhood of a given motion all motions are stable. First 
order stability of the first kind prevails when certain sufficient condi- 
tions hold ; namely, when the linear terms in the expansion of our 
functional equations taken by themselves yield a system which is 
perfectly stable. We have been concerned almost exclusively with 
this definition of stability. 

(d) All conservative physical systems are reversible in time, and 
volumes are preserved in phase space. This rules out stability of 
the first kind. A conservative system free of the dampening dissi- 
pative force of friction, if displaced from stable equilibrium, never 
returns to rest at stationary equilibrium. A pendulum displaced 
from the perpendicular has higher total energy (equal to potential 
energy plus zero initial kinetic energy) than the equilibrium posi- 
tion since the latter represents a minimum. Total energy must 
be conserved, so, in passing through the equilibrium position, the 
system must possess motion in order to provide some kinetic 
energy to be added to the minimized potential energy. 

(e) There remain still other notions such as permanent stability 
(the previous definition holding for all time between minus and 
plus infinity), semi-permanent stability in which the above proper- 
ties hold for “long” periods of time, complete or trigonometric sta- 
bility in which the motion can be approximated by certain har- 
monic sums, etc . 32 

30 G. D. Birkhoff and D. C. Lewis, Jr., “Stability in Causal Systems,” Philosophy 
of Science , II (1935), 310. 

31 This might be termed stability in the sense of Liapounoff. (See E. Picard, Traite 
D' Analyse, III, p. 200.) It is also shown there that instability of the first order rules 
out stability in this sense. Birkhoff has termed this unilateral stability. 

32 See G. D. Birkhoff, Dynamical Systems (New York, 1927), chap, iv, for more 

detailed discussion. ? 
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It is unnecessary to go further into this problem except to point 
out that unlike the greater part of the subjects treated here, the 
analysis of the concept of stability with the implied investigation 
of the qualitative behavior of generalized dynamical paths leads 
into some of the most difficult problems in higher mathematics. 

Nature of the Business Cycle 

We have seen that there are many interesting and fruitful as- 
pects of dynamics which have nothing to do with the business cycle 
as such, but which are important for the understanding of processes 
usually classified under the heading of economic theory. However, 
of all branches of dynamics the one which has received the greatest 
attention is that dealing with the fluctuations in employment, in- 
come, and general business activity. Numerous explanations, as- 
sociated with a variety of names, have been put forward. In fact, 
there have been so many different theories that it has been neces- 
sary to devise a number of different systems of classification in 
which to catalogue them (e.g., underconsumption, overinvestment, 
etc.; or frictional, institutional, monetary; exogenous versus endo- 
genous; etc.). I should like to survey these in a brief way in order 
to isolate the analytical differences involved rather than to concen- 
trate on the historical, institutional, and personal variations. 

(1) It is a commonplace today to employ the terms exogenous 
and endogenous in describing cycle theories. [ The former refers to 
theories which find the origin of the cycle in some outside, non- 
economic datum which varies in a quasi-periodic manner, and by 
one way causation engenders a cycle in economic time series. A 
sunspot or meteorological theory is usually offered as the prototype 
of this class.J Inasmuch as the economic fluctuations whose ex- 
planation we are seeking are not themselves strictly periodic, or 
more than quasi-oscillatory, it is not really necessary that the 
exogenous factors be any more regular in their periodicity and 
amplitude. Nor need the exogenous factors be completely inde- 
pendent of reciprocal influence from the economic system ; strictly 
speaking, it is only necessary that lines of causation be largely one 
way in direction, from the outside in. Analytically the extreme 
exogenous theory is analogous to a “forced” periodic motion, where 
the economic system reacts instantly to the external driving im- 
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pulse. In terms of my earlier six categories, the system is his- 
torical but static. 

(2) At the other extreme there is the purely endogenous theory, 
or the so-called self-generating cycle. Various theories emphasiz- 
ing monetary factors, inventories, the acceleration principle, psy- 
chology, etc., fall under this heading. The determinants of the 
system consist of dynamic equations involving different periods of 
time (lags, derivatives, etc.) which generate recurrent motions. 
Once the cycle is under way, boom gives way to depression, depres- 
sion to revival, revival to boom, and so forth. 

Some economists think that illegitimate circular reasoning is 
involved in this view: that it shows how the cycle, once started, 
perpetuates itself, but begs the question as to the original existence 
of the Cycle. Even if valid, this would be an objection of no par- 
ticular consequence since there are an infinity of exogenous and 
chance factors which could give rise to the initial cycle. Actually, 
as will be shown below, a cycle is not required to start off the 
process, but only an initial displacement from equilibrium, however 
small and noncyclic in character. 

Endogenous Models 

Analytically the pure endogenous cycle is usually likened to 
the motion of a frictionless pendulum which satisfies a simple 
Newtonian second order differential equation. Upon closer ex- 
amination difficulties appear with this notion. In the first place, 
all dampening must be ruled out or else the cycle will come to an 
end; similarly, in most theories anti-dampening or explosive be- 
havior is ruled out. Now in a physical system there are grand 
“conservation” laws of nature which guarantee that the system 
must fall on the thin line between dampening and anti-dampening, 
between stability and instability. But there is nothing in the 
economic world corresponding to these laws, and so it would seem 
infinitely improbable that the coefficients and structural relations 
of the system be just such as to lead to zero dampening. 

There is still another difficulty. Even within an endogenous 
theory the economic system is not regarded as isolated. It is sub- 
ject to outside disturbances, but these are not taken to be related 
to the business cycle in an important way. If we consider a system 
which taken by itself is on the border line between stability and 
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Instability, it may be said to be stable in the second sense discussed 
earlier. But if we now let random variations impinge on the sys- 
tem, it becomes unstable in the sense that the (expected) amplitude 
of its swings increases with time, and its variance approaches 
Infinity in the limit . 33 

I think it ill-advised, therefore, that Kalecki 34 in an empirical 
determination of the coefficients of an assumed mixed difference- 
differential equation system should have imposed the conditions 
that the motion have zero damping. One can sympathize with 
his aim of deriving a system with constant amplitude of fluctuation, 
whatever one thinks of the empirical foundation for this hypothesis. 
But after Imposing a highly improbable constraint, he did not 
achieve what he was looking for; namely, a system with constant 
amplitude of fluctuation. 

Equally important, the purely endogenous analyses so far dis- 
cussed are unable to provide an explanation for the exact or ap- 
proximate amplitude of the cycle. Like the pendulum system they 
are essentially linear in character, and every linear system can take 
on any amplitude depending only upon the magnitude and sign of 
the original displacements. Thus, a pendulum can be made to 
swing in a small or large arc by means of a small or large Initial 
displacement. To explain the observed level of oscillation of the 
cycle the economist with a linear endogenous theory must go back 
to the value of the original prehistoric disturbance, and must ex- 
plain why succeeding shocks have not swelled the cycle. 

There are two avenues of escape from the fundamental diffi- 
culties encountered in the simple, linear, purely endogenous models. 
The first, which has been widely discussed in the technical litera- 
ture, involves dropping the assumption that the system possesses 
zero dampening, and placing reliance upon external shocks to keep 
fluctuations from dying down. This solution involves dropping 
the assumption of a purely endogenous system, but permits the 
retention of linearity assumptions. It Is discussed in detail in 
later sections. It is to be stressed that the exogenous impulses 
which keep the cycle alive need not themselves be even quasi- 
oscillatory in character. 

33 On p. 343 below the sum of the ^4’s will not converge nor will the sum of their 
squares, where there is zero dampening. 

34 M. Kalecki, “A Macrodynamic Theory of Business Cycles,’* Econometrica, III 
(1935), 327-352. 
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The other alternative is to drop the assumption of linearity even 
though this entails considerable mathematical difficulties. For the 
qualitative problem of stability, relations “in the small” were seen 
to have important implications as necessary conditions for stability 
or instability “in the large.” In consequence, it was useful and 
permissible to work with linear relations as a first approximation 
even though that approximation was admittedly not exact. But 
in the realm of quantitative dynamics and economic fluctuations 
this approximation must be dropped if we are to get correct quan- 
titative and qualitative results. 

For linear systems are lacking in the qualitative richness of 
nonlinear systems. The former are either damped or undamped, 
stable or unstable, regardless of the size of initial displacement. 
NonlinSar systems for the first timefintroduce a theory which accounts 
for fluctuations of a particular amplitude , independent of the initial 
displacement. Thus, a nonlinear system may possess an unstable 
stationary level so that when the equilibrium position is displaced 
in the slightest, ever increasing swings are begun. But instead of 
oscillating to infinity, a particular amplitude is finally reached and 
maintained. This periodic motion may be stable in the sense that 
any further disturbance will result in a motion which approaches 
the given periodic motion from above or below. 

As an example of such a nonlinear equation, consider the 
differential equation 

x — ju(l — x 2 )x 4- x = 0. (53) 

While zero is a stationary equilibrium level of the system, an 
evaluation of the coefficients of the above equation for that level 
will show that the equilibrium is unstable. For small departures 
from equilibrium the system behaves like an explosively oscillatory 
linear system, but finally the nonlinearity asserts itself, and the 
motion settles down to a fixed amplitude. Geometrically, in the 
( x , dx/dt) phase space the equilibrium point is given by the origin, 
out from which flow expanding spirals. However, each of these 
approaches a closed curve representing the periodic motion. If we 
begin initially from a point outside the closed curve, we have con- 
tracting spirals which approach the closed curve from without. 
The stability of the periodic motion in the small is verifiable by 
examining the solution of a variational linear differential equation 
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of the second order with periodic coefficients, derived by evaluating 
the coefficients of (53) along the periodic motion in question. This 
equation of variation in turn is seen to be stable because its 
“ characteristic exponents or multipliers’' all have negative real 
parts. 35 The uniqueness of the periodic solution is more difficult 
to establish, but in the case cited it has been done. 

Such nonlinear systems have received a certain amount of 
attention in theoretical mechanics under the headings of ‘ 'self- 
exciting oscillations” or ‘'relaxation oscillations.” 36 Much still 
remains to be done in this field. In the field of economics one 
nonlinear system has received complete treatment. I refer to the 
so-called cobweb theorem in which supply lags one period. In 
general, this leads to a nonlinear difference equation of the first 
order, which is so simple as to permit of a complete graphical 
solution. 

As we have seen in previous chapters, the linear cobweb case 
permits of only three possibilities: for any amplitude the system is 
either damped, anti-damped, or exactly in between. (Actually in 
real life, both in an economic or physical system, a movement to 
Infinity is unthinkable. Therefore, if a linear system is unstable, 
its motion will grow until it no longer remains linear; its structure 
will “give,” etc.) Dropping the assumption of linearity, we find 
that the system may possess special periodic motions other than 
that of stationary equilibrium. Thus, with the equilibrium point 
unstable, there will be a stable “box” approached by all near-by 
motions, the size of the box determining the unique amplitude of 
the cycle. There may, of course, be several such periodic motions, 
each being alternately stable and unstable. In real life the last 
one can in most cases be taken as stable. 37 

35 See G. D. Birkhoff, Dynamical Systems, chap. iii. 

36 Equations of this type are sometimes known as Van der Pol equations after the 
name of the man who has devoted a good deal of attention to them. However, their 
history goes back into the 19th century at least. A partial bibliography can be found 
in T. Von Karmin, “The Engineer Grapples with Nonlinear Problems,” Bulletin of the 
American Mathematical Society , XLVI (1940), 615-683. See also Norman Levinson and 
Oliver K. Smith, “A General Equation for Relaxation Oscillations,” Duke Mathematical 
Journal , IX (1942), 382-403; B. Van der Pol, “Relaxation Oscillations,” Philosophical 
Magazine , II (1926), 978-992. 

37 There is a vast literature on the cobweb theorem. The most complete treatment 
seems to be that of W. Leontief, “Verzogerte Angebotsanpassung imd Partielles Gleich- 
gewicht,” Zeitschrift fur Nationalokonomie, Band V (1934). 
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Elsewhere in economic analysis there are to be found suggestions 
of nonlinear dynamics . 38 However, the formal difficulties of solu- 
tion are so great that very much remains to be done. This is all 
the more important since a careful analysis of various literary 
theories will show that some of the most simple of them depend in 
an essential way upon nonlinear elements. Among these is a wide 
class of what I have termed “billiard table” theories, which deduce 
a turning point of the cycle from such considerations as the fact 
that the system “hits the full employment ceiling and bounces 
back, so to speak.” Another of this type is the Hawtreyan notion 
of the minimum reserve ratios of banks against which the system 
rebounds. Such notions are less successful in explaining the lower 
turning ‘point since there is no (relevant) natural bottom to the 
economic system . 39 

It is not unexpected to find that the simplest empirical notions 
may lead to the most complicated mathematical problems. This 
is a fact to inspire humility in both literary and mathematical 
investigators, but should prove discouraging to neither. 

Mixed Exogenous-Endogenous Theories 

It is not necessary to subscribe exclusively to one of these two 
polar types. Perhaps most economists are eclectic and prefer a 
combination of both. For example, an economist who believed in 
the reality of waves of different length might plausibly regard the 
long Kondratieff wave as being primarily exogenous in character, 
depending as it does upon wars, gold discoveries, and great tech- 
nological upheavals. Its impact upon the system might involve 
endogenous, transient cyclical movements, but their movements 
might take place in a much shorter time period so that compared 

38 See Pk. Le Corbeiller, “Les Systemes Autoentretenus et les Oscillations de Relaxa- 
tion,” Econometric a, I (1933), 328-332. J. Tinbergen, Statistical Testing of Business 
Cycle Theories (Geneva: League of Nations, 1939). 

39 Professor A. H. Hansen, in his Fiscal Policy and Business Cycles (New York: 
Norton, 1941), chap, xiii, seems to regard the level at which the average propensity to 
consume is 100 per cent as such a natural bottom. I think this is putting the matter 
much too strongly, even though I agree that below this level certain upward tendencies 
are brought into play. R. F. Harrod’s The Trade Cycle (Oxford, 1936) places consider- 
able emphasis upon nonlinear factors in connection with his “dynamic determinants.” 
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to the length of the Kondratieff cycle these might be regarded as 
"rapidly damped” and be neglected. 40 

On the other hand, the extremely short fluctuations, to the 
extent that they exist, might be explained almost wholly in endog- 
enous terms with reference to the dynamics of inventories, the 
acceleration principle, speculation, etc. These short cycles would 
perhaps die down were it not for the random and systematic dis- 
turbances received from the longer movements in investment. 

To illustrate the above remarks the reader may construct for 
himself a model in which there is superimposed a regular periodic 
movement of autonomous net investment upon a system in which 
both the multiplier and the acceleration principle are operating. 
The period of the autonomous movement should be longer than 
the intrinsic period of the (damped) responding mechanism. Spe- 
cifically, consider the equation 

Y(t + 2) - a(l + /3) Y(t + 1) + aPY(t) = P( 0 , (54) 

where a is the marginal propensity to consume, j3 is the so-called 
“relation” of the acceleration principle, and P{t) is a periodic move- 
ment, not necessarily a pure sine wave. 41 

If the process has been going along for a long time, it will finally 
approach a periodic motion of income in which autonomous invest- 
ment shows a lead over income, and in which the amplitude of the 
final motion is proportional to that of Pit), the exact factor of pro- 
portionality increasing with the nearness of the intrinsic period of 
the left-hand side to the postulated periodicity of the right-hand 
side, i.e., with the approach to resonance. 

However, if we break in at an arbitrary point of time, there will 
be in addition to this motion a damped transient whose qualitative 
properties depend only on the endogenous response characteristic 

40 On the whole, we should rather expect greater irregularity in amplitude and 
periodicity in a cycle generated by exogenous factors than in one endogenous in character. 
This is in accord with the prevailing view that a diversity of factors is responsible for 
the few recorded “long weaves” of economic history, and with the notion that there is 
much less predictability into the future of even the qualitative features of such a move- 
ment. On all these matters the reader may be referred to the well-known views of 
Schumpeter, Mitchell, Hansen, et al. 

41 Cf. P. A. Samuelson, “Interactions between the Multiplier Analysis and the 
Principle of Acceleration,” Review of Economic Statistics , XXI (1939), 75-78. E. H. 
Bennion has worked out a number of interesting arithmetical models in which the agree- 
ment between theory and the model sequences is complete. 
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of the multiplier-relation system. This can give shorter cycles, 
brought into life so to speak by the shocks incident to the longer 
wave. 42 

Mixed Systems of a Linear Stochastic Type 

Thus far I have been considering mixed exogenous-endogenous 
systems in which the exogenous forces are periodic in nature, or at 
least quasi-periodic. There is, however, a stochastical theory de- 
signed to explain the existence of quasi-periodic cycles by means of 
a damped system responding to random shocks. The latter serve 
to keep alive the fluctuations of the system in spite of dampening. 43 
But in doing so they tend to displace the phase of the given motion 
so that ordinary periodogram analysis will not, in a long series, 
reveal a ’’“significant” period in the vicinity of the intrinsic fre- 
quencies of the responding system. This is intuitively obvious if 
we think of ordinary periodgram analysis as Fourier analysis, and 
the latter as equivalent to fitting by least squares the best single 
harmonic to the time series in question. The ordinate of the usual 
periodogram is equal to that part of the total variance (in absolute 
or percentage terms) of the time series which can be explained by 
the best harmonic of that frequency. Because of the constant dis- 
turbance of phase, in a long series no sine wave will give a good fit. 44 

It remains an open question whether other of the usual methods 
of time series analysis (counting distances between peaks and 
troughs, etc.) will suffice to give back the known periods even in 
artificially constructed model sequences of the type of equation (1) 

42 If we drop the assumption of linearity, then the response of the system will show 
some differences. The amplitude of income will not be simply proportional to the 
amplitude of the function P(t ), nor will the final solution be a simple additive combina- 
tion of periodic and transient components. Qualitatively, however, the result will be 
a final approach to a periodic motion with characteristic shorter waves in the transition 
period. Unless kept alive, these will die out. 

43 H. Wold, A Study in the Analysis of Stationary Time Series (Uppsala, Sweden, 
1938). 

44 Cf. the reference given in footnote 43. See also the brilliant contribution of 
Professor Ragnar Frisch to the Cassel volume, “Propagation Problems and Impulse 
Problems in Dynamic Economics,” Economic Essays in Honor of Gustav Cassel (London, 
1933), pp. 171—205. While ordinary periodogram analysis will not do, the “Generalized 
Harmonic Analysis” of Norbert Wiener is designed precisely for problems of this type. 
See the references to his 1930 Acta Matkematica article in H. T. Davis, The Analysis of 
Economic Time Series (Bloomington, Indiana: Prindpia Press, 1942), and also the refer- 
ence there to the 1935 article of Bartels, and the latter’s suggestive concept of the 
“harmonic dial.” 
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above. It is clear, however, from the work of Slutsky 45 and others 
that the time series generated by such sequences resemble quali- 
tatively the usually encountered economic time series. 46 

By means of the analysis of the previous chapter it is clear that 
(except for terminal adjustments which become negligible in a long 
series) the solution of the general dynamic stochastic system of 
the following type 

L(7) = Y(t) + ai Y(t -!)+•••+ a n Y(t - n) = {*«}, (55) 

where z is a random variable serially uncorrelated in time and 
drawn from an unchanging universe whose first two moments 
(0, a z 2 ) exist, takes the following form 

Y{t) = AoZ(t) + AxZ(t - 1) + • • • + A n Z{t -») + ••'. (56) 

As t grows the number of coefficients in this sequence becomes in- 
finite, but in such a way as to leave their sum and the sum of their 

n 

squares finite and equal respectively to (1/X) k). By the usual 

0 

central limit theories it is easy to show that (except for terminal 
adjustments) the variance of Y(t) is given by 

Vt = (4o 2 + 4i 2 + ••• +Afle* 9 (57) 

1 or in the limit by 

Fco - k<x 2 \ (58) 

where k is finite. The actual frequency distribution of Y(t) tends 
to normality with increasing t , with zero mean and variance equal 
to the last given expressions. This is usually true even if the orig- 
inal frequency distribution of z is not at all normal (i.e., Gaussian). 

The interested reader can satisfy himself that the superposition 
of a random variable on to a periodic force leads to a forced motion 

45 Eugen. Slutsky, “The Summation of Random Causes as the Source of Cyclic 
Processes,” Econometrica , V (1937), 105-146. 

46 The article by Trygve Haavelmo, “The Probability Approach in Econometrics,” 
Econometrica , vol. XII, Supplement (1944), deals with the problems of determining 
empirically such stochastical relations. The article by H. B. Mann and A. Wald, “On 
the Statistical Treatment of Linear Stochastic Difference Equations,” Econometrica , 
XI (1943), 173-220, shows that the conventional least squares treatment of autocorrela- 
tion is (asymptotically) a “consistent” method of determining the a coefficients. It 
would be outside the scope of the present work to go into these problems. A number 
of articles in the Annals of Mathematical Statistics for 1942 deal with the sampling; 
distribution of the autocorrelation coefficient. 
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just like that described above except that the mean of the asymp- 
totically normal distribution fluctuates according to the periodic 
function described in the previous section. He can also work out 
the implications of having a stochastic variable z t which is not 
serially independent. 

Non-Linear Stochastic Systems 

A more difficult problem is that of developing for a non-linear 
system the stochastic theory which corresponds to that dealt with 
in the previous section. As far as the present writer is aware, this 
is almost completely unexplored ground. No more than a bird’s- 
eye view of the problem can be attempted here. 

We<onsider a non-linear system of the form 

Y(t) - £Y(t - 1 ),•••, F(/ - n), zm = 0, (59) 

whose initial conditions may be written in the abbreviated matrix 
form 

Fo = [F(— i)J (60) 

where i goes from zero to ( n — 1). If desired, the F’s and Z’s may 
be considered as column matrices of many variables. As before, 
the Z refers to a random variable drawn from the same universe 
over time and without serial correlation. Formally, the solution 
may be written in the form 

F(0 = FlZit), Z(t - 1), • • • , Z(0), Fo], (61) 

where the exact nature of F depends upon that of /. It has been 
shown in Mathematical Appendix B, Section 8, that with an ap- 
propriate redefinition of the variables Y and Z we may consider 
equations (59) to be of the first order, i.e., to depend only upon the 
value of Y one period ago. The new Z will now have zeros in some 
of its components, but as before there will be no serial correlation 
between successive Z’s. 

Under certain restrictions upon the partial derivatives of/, such 
as correspond to the economic reality of any relevant damped eco- 
nomic system, it should be possible to enunciate limit theorems not 
unlike those holding for linear systems. The fact that so many 
well-known non-linear statistics go to normality in the limit as the 
sample size becomes large suggests that many of these limit 
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theorems would actually be of Gaussian form. 47 It would be a 
task of some delicacy to establish conditions under which this must 
certainly be so. However, it is not to be thought that in every case 
the limiting distributions must be of Gaussian form. In a moment 
I shall specify a non-linear system which approaches a limiting 
distribution not of Gaussian form; in fact, an example will be given 
of one which remains bounded but approaches no stationary (prob- 
ability) state, but oscillates indefinitely in a simple periodic 
fashion. 

The key to a successful analysis of the difficult non-linear case 
lies in shifting our attack from the study of any one particular 
motion as it is bombarded by a particular set of random shocks to 
the analysis of the probability states corresponding to all possible 
shock patterns weighted in accordance with their likelihood. 
These contrasting approaches are to a certain extent similar to the 
contrast between the motion of a single molecule, the totality of 
motions of an ensemble of molecules as in statistical mechanics, and 
the kinetic theory of gases which describes macroscopic states of a 
system. However, the analogy is not perfectly complete and 
should not be pressed too far. 

First, let us assume that the initial conditions at time t are 
known. Then from our knowledge of the probability universe of 
Z t we may write down immediately the conditional probability 
description of Y t + 1 , given Y t , namely, 

P(Y t+ i, Y t ) = H(Y t+ i, Y t ), ' (62) 

where the exact form of H can be easily specified as soon as / and 
the probability distribution of Z t are given. 

Now if we assume that the exact probability distribution of Y t 
is known, and equal to P t (Y t ), it is possible to write down the 
distribution of Y t + 1 in the form 

P i+ x(F J+ i) = f" H(Y t+1 , Y t )Pt(Y t )dY t . (63) 

It is to be noted that even in the non-linear case the probabilities 
at different instants of time are related by a recursive linear 
functional. 

47 Unlike most statistics computed from a sample, the F operations on the Z’s are 
definitely not symmetric functions. 
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For concreteness we may show what form the II takes in the 
simplest case of a one variable first order system of the form 

yt+ 1 = ay t + Z t (64) 

with probability density for Z given by R(Z). Then corresponding 
to equation (63) we have the relation 

Pf + i(F (+ x) = f R(y t +i - ayt)P t (yi)dy t . (65) 

—oo 

To see whether or not P t approaches a limiting distribution it 
is natural to put the same P in both sides of the above equation 
and solve the resulting equation for the form of the unknown dis- 
tribution. This is an integral equation of the Fredholm type but 
with infinite limits. For the special type of R given above, and 
for a less than unity, the central limit theorem may apply. 

For absolute a greater than unity we know that the variance in- 
creases without limit, and that there cannot be a limiting form 
other than the trivial zero solution. We may state this in another 
manner. If we put a parameter X in front of the integral in (63) 
and ask for a function P which satisfies both sides, such will exist 
only if X is a characteristic value or eigen-value of the kernel H . 
For undamped linear systems X = 1 will not be a characteristic 
value. It would be a task of some considerable mathematical 
difficulty to indicate just when the integral equation arising from 
a non-linear system had an eigen-value equal to unity and to derive 
the corresponding eigen-function Pit). 

The problem would be simplified if we could assume that our 
non-linear / gives bounded values, whatever the value of previous 
Y’ s. This is obviously possible only for non-linear systems ; earlier 
discussion of the physical limitations of full employment and zero 
income suggest that it Is often realistic to assume barriers which 
guarantee this fact. With this assumption the absolute value of 
Y must be less than some number M, and we have —M and +M 
as the limits in the Integrals instead of the infinite limits. This 
removes the singularity in the Fredholm equation. It is to be 
hoped that this continuous but bounded case will be fully analyzed 
by mathematicians and economists. 

I shall confine myself to the simpler case where Y is a discrete 
rather than a continuous variable. By making the classification 


, i. 
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fine enough, we can reach any empirical degree of approximation 
to reality so there is no essential loss of generality involved. By 
replacing the above integrals by Stieltjes integrals, both cases can 
be handled simultaneously. However, in the simpler case under 
discussion, the integrals can be described by sums. But in this 
case Y takes on only integral values, and the probability functions 
for Y and Z are at each time denumerable sequences of numbers. 
Corresponding to the kernel H we now have a matrix H w r hose 
properties depend upon the / function and the probability se- 
quence i?(Z). 

However, to make the problem manageable there are other 
difficulties to be taken care of. Even in the simple linear case, 
with Z taking on integral values and the initial F J s taking on 
integral values, the Y’s at a later stage will not be confined to 
integral values unless the coefficients of the integral equation are 
themselves integers. Let us satisfy this requirement in our simple 
linear example of equation (64) by setting a equal to unity. Our 
H matrix is then seen to consist of an infinite number of rows and 
columns, each column consisting of the R sequence of probabilities 
referring to the different Z values, with 22(0) centered in the 
diagonal of the matrix as follows: 


• -R(0) 

R(- 1) 

R(- 2)--- 


R( 0) 

*(-!)••• 

■■R( 2) 

R( 1) 

R( 0) ••• 


R( 2) 

R( 1) ••• 


It is not an easy matter to work rigorously with infinite matrices 
and their latent roots and vectors. In this case we can be sure 
that there is a latent root of unity by virtue of the property that 
probabilities must add up to unity, but we cannot attach unam- 
biguous meaning to this, nor solve for the corresponding latent 
vector. 

It might be thought that the difficulty is of our own making 
in that the R sequence has been considered to be infinite. How- 
ever, the assumption of a finite number of terms in R only intro- 
duces zeros in each column after one reaches a certain distance 
from the diagonal. The matrix must still be considered to be 
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infinite in size, and, if we wait long enough, F can take on any 
integral value, however large. 

There is the further more serious difficulty that a damped sys- 
tem with leading coefficient equal to unity cannot have coefficients 
which are all integers. Clearly an equation whose roots are all less 
than unity in absolute value will have a product less than unity, 
and hence a coefficient which is not an integer. 

Therefore, even in the discrete case I shall make the non-linear 
assumption that the difference equation which defines the dynamical 
path of the system is such as to yield bounded values of Y t . It is 
convenient to assume that the F’s in addition to being discrete take 
on only a finite number of values in this bounded interval; i.e., we 
“round dff” to some degree of accuracy. Because we make our 
classifications as fine as we wish, there is no serious loss of generality 
involved. In this case the probabilities at time t of the different 
values of F, n in number, can be represented by the equation 

P <+i — HP t> (67) 

where the P’s are column matrices of n elements, and the II is a 
square n by n matrix, whose properties depend upon those of the 
non-linear system and on the probability distribution of Z. It 
will be noted that the sums of the columns of H are in every case 
unity. This follows from the consideration that if F is certainly in 
position i at a given time (so that the P vector has zeros everywhere 
except for a unity in the fth element), then it must certainly be 
somewhere one period later. But its probability of being in each 
of the n positions one period later is under these conditions nothing 
but the ith column, whose sum must therefore add up to unity. 
Note also that all of the elements of H must be positive in virtue 
of their interpretation as conditional probabilities. 

Because the columns add up to unity we place a minus one in 
each diagonal and add each row to the first row. The result is a 
singular matrix with zeros in the first row. This shows that unity 
is a latent root. Similarly the fact that all elements are positive 
and add up to unity guarantees that there are no latent roots which 
exceed unity in absolute value. For if there were, we could select 
initial conditions for which a given element in P would grow ex- 
ponentially without limit. This contradicts the assumption that 
no (probability) element of P can exceed unity. 
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In most cases there will be one root of unity and all other roots 
less than unity in absolute value. Thus, whatever the original 
probability distribution of Y, it will gradually approach a station- 
ary probability state given by the latent vector of H corresponding 
to the unit latent root. It will satisfy the equations 

HP = P, (68) 

where the P is normalized so that its elements (and not their 
squares) add up to unity. The exact values of the P’s can be de- 
rived by solving ( n — 1) linear equations. It will be noted that 
the solution secured will have nothing to do with the limiting 
Gaussian form. 

In special cases the unit latent root may not be simple so that 
no unique stationary probability distribution is reached. Tn still 
other cases there may be another latent root whose absolute value 
is equal to unity, and which is either complex or equal to minus one. 
In either case for “most” initial probability states there will be no 
approach to a stationary probability state; instead there will be 
periodic oscillation. A simple example is provided by a 2 by 2 H 
matrix of the form 

U S! <«» 

If we start out with a probability vector (a, b), it gives rise, because 
of the negative latent root, to the oscillatory sequence ( b , a), ( a , b), 
( b , a), etc., without ever approaching a limit. 



CHAPTER XII 


CONCLUSION 

Economics is a growing subject in which very much is left to be 
done. It is only appropriate, therefore, in bringing this work to a 
close, to indicate a few of the important unsolved problems crying 
out for further investigation. 

In the first and second chapters I outlined the general problem 
of comparative statics: how from a knowledge of the qualitative 
and quantitative properties of our equilibrium conditions we can 
hope to deduce meaningful theorems concerning the direction and 
magnitude of changes in our variables when certain data change. 
In chapter iii it was shown that in a large class of cases the econo- 
mist derives definite theorems by means of the hypothesis that the 
equilibrium position represents a maximum or minimum position. 
The inequalities associated with the definition of an extremum 
position were seen to be the source of fruitful theorems in compara- 
tive statics. 

Chapter iv represented an application of this analysis to the 
cost and production theory of the firm, just as chapter v gave a 
treatment of constrained maxima as required by the theory of con- 
sumer’s behavior. Special aspects of the latter subject received 
treatment in chapters vi and vii. And finally the study of static 
maxima and minima was brought to completion in the analysis of 
welfare economics given in chapter viii. 

In the first chapter of Part II it was shown that simply from 
the standpoint of fruitful comparative statics dynamic analysis is 
useful and necessary. Indeed, the correspondence principle, enun- 
ciating the relationship between the stability conditions of dy- 
namics and the evaluation of displacements in comparative statics, 
provides the second great weapon in the arsenal of the economist 
interested in deriving definite, meaningful theorems. 

In chapter x dynamical systems are studied for their own sake, 
particularly in their stability aspects, while in chapter xi I have 
discussed various fundamentals of dynamical analysis including 
formal problems arising out of the study of business cycles. 
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Broadly speaking, the development of analytical economics has 
proceeded in a natural evolutionary order. First, in Walras we 
have the final culmination of the notion of determinateness of equi- 
librium on the statical level. This has received further elaboration 
in the hands of Pareto and others. 

However, Pareto took a second, further step. He laid the basis 
for a theory of comparative statics by showing how a change in a 
datum would displace the position of equilibrium. Even earlier 
Cournot had pioneered in this “infinitesimal” anafysls, although 
with reference to a narrower set of problems. 

While Pareto laid the basis for comparative statics , his own work 
was not rich in definite theorems in this subject, precisely because 
he rarely concerned himself with the secondary inequalities relevant 
to maximum positions. On the few occasions when he did do so, 
he came to grief because of mathematical errors In their statement. 
It was left for W. E. Johnson, Slutsky, Hicks and Allen, Georgescu- 
Roegen, Hotelling, and other modern writers to begin to make 
progress along this third line. 

However, only a part of economic theory is concerned with the 
maximizing action within an economic unit. Where the Inter- 
actions between individuals are concerned, the scope of fruitful 
comparative statics may be greatly extended by a fourth advance, 
the apprehension of the correspondence principle , whereby the com- 
parative statical behavior of a system is seen to be closely related 
to its dynamical stability properties. 

A natural fifth step to take after we have investigated the 
response of a system to change in given parameters is to Investigate 
its behavior as a result of the passage of time. Thus, we cultivate 
dynamics for its own sake, especially with respect to the qualitative 
properties of the respective motions. 

The usefulness of any theoretical structure lies in the light 
which it throws upon the way economic variables will change when 
there is a change in some datum or parameter. This commonplace 
holds as well in the realm of dynamics as in statics. It is a logical 
next step, therefore, to begin to create a theory of comparative 
dynamics . This will include the theory of comparative statics as 
a special case, and indeed all of the earlier five subjects, but it will 
cover a much richer terrain. 

The central notion of comparative dynamics is simple enough. 
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We change something (just what need not concern us at the mo- 
ment), and we investigate the effect of this change on the whole 
motion or behavior over time of the economic system under in- 
vestigation. It will be seen that comparative statics involves the 
special case where a “permanent” change is made, and only the 
effects upon final levels of stationary equilibrium are in question. 

In comparative dynamics we consider a much broader class of 
changes, (a) We may make a change in initial conditions. By 
definition this alters the immediate behavior of the system in a 
known way. By the assumption of continuity we may infer that 
the position of the system for some region adjacent to the initial 
conditions is also altered in the same direction. For intermediate 
lapses of time a separate investigation is necessary to determine 
what happens to the system. However, for a stable system it is 
clear by virtue of the definition of stability that for sufficiently 
long time periods there will be no final alteration in the behavior 
of the system. 

(b) We may make a change in some force acting on the system. 
Thus, we may cause autonomous investment to vary. Actually, 
there are a variety of cases which must be considered. The change 
in force may be permanent; it may be intermittent; it may be 
transient or instantaneous. In this very last case the analysis may 
be subsumed under the heading of a shift in initial conditions. In 
the case of stable systems the response to a permanent alteration 
gives us a description of the actual path followed by a system in 
going from one “comparative static level” to another. 

For linear dynamic systems, but only for these, the most general 
of the above variations can be thought of as made up of the cumu- 
lated effect of unit impulses, or of changes in the instantaneous 
initial conditions of the system. This follows from the basic 
superposition theorem which underlies much of applied mathe- 
matical analysis. 

(c) Finally, there may be a change in some internal parameter 
of the system. We may ask, for example, what the effect of a 
change in the marginal propensity to consume or in the “relation” 
may have on the behavior of a system. Again, the change in 
question may be permanent, varying, transient, etc. 

The rich variety of forms which the change in data may take 
is matched by the numerous ways in which we can choose to de- 
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scribe the “resulting effects on the behavior of the system.” Ex- 
cept in the most simple cases we shall find it necessary to summarize 
in various ways the information contained in the resulting changes 
in the system at each and every instant of time. 

From a short run point of view interest will center on the im- 
mediate reaction on the system. The answer to this question can 
often be secured by a method which is formally like that of com- 
parative statics with the significant difference that some of the 
variables treated are really dynamic in character. 

This may be illustrated by the important case in the Keynesian 
analysis where investment is taken to be a variable whose value is 
to be determined by a system of relations like those discussed at 
the end of chapter ix. For the usual Keynesian theory, in the 
short run at least, the stock of capital is taken to be constant. 
By “solving” such a system we can finally reduce our relations 
down to a single equation between the amount of investment, I, 
and the stock of capital, K, and the value of some parameter, a. 
This parameter might well be the amount of thriftiness in the 
system. Now if we are interested in the effect upon the stock of 
capital in the immediate future of a change in this parameter, the 
answer can be given by treating investment as if it were an ordinary 
statical variable and solving, as in chapter ii, our equilibrium equa- 
tions to determine the direction of change of investment with 
respect to the parameter, a. 

However, to answer our question in comparative dynamics we 
must introduce the fact that investment, which we have previously 
been treating as an ordinary statical variable, is really the rate of 
change of the stock of capital, or equal to dK/dt. If the compara- 
tive statical analysis tells us that investment is reduced by an in- 
crease in thriftiness, we can be sure that in some sufficiently short 
run the amount of capital in existence will be less than it would 
otherwise have been. For if two curves start out from the same 
point with differing rates of increase, we can be sure that the one 
with the greatest rate of increase will exceed the other for some 
small region at least. 

As it happens, a similar statement can be given in this case to 
the wider problem as to what happens to capital in the long run as 
a result of a change in thriftiness. Thus, if instead of simply asking 
what level of consumption maximizes current investment, we- 
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widen Professor Lange’s question 1 and seek the levels of consump- 
tion leading to the most capital at each instant of time, we shall find 
that capital formation in a run of any length is only maximized if 
at each instant the Lange criteria are met. 

However, when the problems of comparative dynamics are 
posed in general and realistic form, it turns out that the above 
identity results almost by chance. Thus, consider a system of two 
or n dynamical equations 


dxj 

dt 


= g'{x h ■ • •, x n , a). (i = !,•••, n ) 


( 1 ) 


Suppose, furthermore, that an increase in a always increases dxi/dt, 
or that ga 1 is always positive. Does it follow that an increase in a 
always fesults in a higher X\ at each subsequent instant of time? 
The answer is no. For some sufficiently small instant of time, 
starting out from given initial conditions, this must of course be 
true, but it need not continue to be true. In the one variable case 
to which I reduced the Keynes-Lange system, the stronger theorem 
is true. For in this case we may solve explicitly our differential 
equation in the form 


t - t° 


f xi in = o 

g l {u-,a) 


( 2 ) 


and by partial differentiation of the above implicit relation between 
t, Xi, and a, it is easy to show that the change in Xi with respect to a, 
t being fixed, must be of the same sign as the definitely known 
coefficient gf. The reader can work out the geometrical reason 
for this. 

Not only is the direct solution of the system impossible in the 
multi-variable case, but the corresponding theorem is definitely 
untrue. Economically this is not hard to visualize. If the param- 
eter, a, has a pronounced effect upon the growth of a second vari- 
able, after a sufficiently long period of time this indirect influence 
may outweigh the direct favorable influence on the first variable. 
The reader may wish to work out an even more complicated model 
than the above in which a enters as a dynamic parameter. Thus, 
it is not difficult to construct a model in which deficit financing has 

1 0. Lange, “The Rate of Interest and the Optimum Propensity to Consume,” 
Economica, V (1938), 12-32. 
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a favorable effect upon the growth of capital in the short run, but 
the accumulated bad effects of the growing debt are adverse to 
capital growth. I do not wish to pass judgment upon the reality 
of the above assumptions, but simply wish to point out the possible 
occurrence in economic systems of the common medical phenomena 
whereby short term remedies may have long term deleterious 
effects. 

Of course, if we confine our attention to the behavior of long 
period stationary equilibrium positions, the methods of compara- 
tive statics come into their own again. All time derivatives, differ- 
ences, etc., are set equal to zero and the resulting system solved 
like any statical system of chapter ii. 

Finally, the economist will often be interested in the effect upon 
some characteristic of the motion of the system. How^ will a 
change in the “relation” affect the average level of a fluctuating 
trendless system, its periodicity, its dampening, and its amplitude? 
Nor need we confine ourselves to simple stable systems. Thus, it 
is of the greatest interest to know how a change in age-specific 
mortality will affect the net-reproductive-rate; or how the period 
of delay in the expenditure of income will affect the rate of inflation 
under a given defined inflationary gap. 

The further development of analytical economics along the lines 
of comparative dynamics must rest with the future. It is to be 
hoped that it will aid in the attack upon diverse problems — from 
the trivial behavior of a single small commodity, to the fluctuations 
of important components of the business cycle, and even to the 
majestic problems of economic development. 



MATHEMATICAL APPENDIX A 


Many mathematical theorems of importance to the economist 
are not to be found collected in any one convenient place. With- 
out striving for rigor, I have gathered together some of those most 
relevant to the matters under discussion in the present work. 


I. Maximum Conditions for a Single-Variable Function 
Let 

2 = M 

be a defined function with a continuous second derivative 1 every- 
where on an interval 

a ^b. 

z will enjoy a relative maximum at the point f°, providing 

a < t° < b (1) 

for neighboring values of t. 

It is necessary in order that this condition be satisfied that 

f(t°) = 0, (2) 

and 

/"(*°) Si 0. (3) 

The first of these conditions is derived in the following manner. 
By the Theorem of the Mean, 

m - m = (t - + e(t - t°n o<e<i 

Suppose 

m > 0 . 

Then from our continuity assumptions there exists an interval 

1 1 — t° \ < m 

within which the following inequality holds everywhere 

m > o. 

Hence, for all such neighboring values of t larger than f°, we have 

m -m >o, 
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which contradicts our hypothesis. Similarly, 

At*) < 0 

would lead to a contradiction. Therefore, 

f(t°) = 0. 

The second necessary condition is derived in a similar manner. 
From Taylor’s expansion with a remainder we have 

m - m = (.t- tw ) + - ~2 — fu a + - /»)]. 


0 < 6 ' < 1 


But 

4t 

and so this becomes 


f(t°) = 0, 


m - m = (t 2 toy rD° + e\t - m 

Suppose 

> 0. 

Then by assumed continuity 

fit) > 0 


o < e' < l 


1 1 — t° | < m'. 


on an interval 
In consequence, 

m -m> o 

on this interval. But this is a contradiction. Therefore, 

/"(*°) ^ 0. 

Actually, it can be shown under conditions of proper continuity 
that the first non-vanishing derivative must be even and negative 
for a relative maximum. 

I define a regular maximum as one for which the equality sign 
in equation (3) is excluded. It is necessary and sufficient in order 
that z enjoy a regular maximum at t° that 


and 


m = 0, 
rm < o. 


( 4 ) 
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Similarly, necessary conditions for a relative minimum under 
our assumed conditions are 

f(t°) = 0, 

and 

2 0. (5) 

Necessary and sufficient conditions for a regular minimum are 

nn = o, 

/"(*°) > 0. (6) 


and 


Let 


II. Maximum Conditions for a Function of 
Many Variables 


z = f(xu ■■ x n ) 

be a defined function with continuous second order partial deriva- 
tives of all kinds in an open region 5. Then a point (X°) in 5 
affords a relative maximum to z provided that 

f(xi, •••,«„) ^ f(x x °, • • • , x„°) 

for sufficiently close values of (X). 

It is necessary in order for this to be true that 


= 0 
= 0 


df(xi°, • • • 

, xj) 

dxi 


df(x i°, • • ■ 

, X n °) 

dx 2 


<3/(*i°, • • • 

, x„°) 


dx n 


= 0 


and 

dx-tdxz dxidxz dx 2 dx„ 


+ 




dj 

dX n dX n 


h 2 

Un f 
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where (hi, h 2 , • • • , h n ) are arbitrary numbers, and the partial de- 
rivatives are evaluated at the point (X°). 

If we adopt the following notation 


fi° = fiW, 
and 


0 \ • • • , 

’ ” dXi 

f 0 _ 9J(X 1°, • • • , X ra °) 
•'* V dXidXj 


(« = !,•••,») 


then our necessary conditions can be written symbolically 

fi° = 0, (* = !,•••,») (7) 

and 



SiO. 


( 8 ) 


It is necessary and sufficient for a regular relative maximum that 
fi° = 0, O' = 1, • n) 

and 


E E < 0 (9) 

1 x 

for not all h’s equal to zero. In addition to the vanishing of all 
first partial derivatives, a regular relative maximum requires that 
the quadratic form whose coefficients are the second partial deriva- 
tives be negative definite. This last condition is the generalization 
to many variables of the single-variable condition that the second 
derivative be negative for a regular relative simple maximum. Its 
meaning will be amplified in a later section. 

The proof of these necessary conditions can be easily demon- 
strated by the use of a device which enables us to convert our 
problem into that of a single-variable one, upon which we can 
bring to bear the results of the previous section. 

Let 

Xi = Xi° + th it (i — 1, •••,») 

where the h’s take on arbitrary values. Clearly for a relative 
maximum we must have 


f(x i° + thi, xi° + thi, • 


, X n ° + tk n ) S/(X!°, • • • , X n °). 
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For given values of the h’s, not all equal to zero, / can be regarded 
as a function of the single variable i, and the above condition can 
be written 

m s/(o). 


From the previous section we know that this requires that 

m = 0 , 

and 

/"( 0) Si 0. 

But, by performing the indicated differentiation, we find 


and 


jf'(0) = Z fi°hi = 0, , (10) 


/"( 0) = EE Mkfa == 0. 

1 1 

In order that (10) be satisfied for arbitrary values of the h’s, each 
partial derivative must vanish, or 

fi° = 0. (»=!,•••,») 

For a regular relative maximum, we get by this method 
fi° = 0, (i = 1, • • •, ») 

and 

Z i: < 0 , not all h’s = 0, 

1 1 

these conditions being both necessary and sufficient. 

Similarly, a regular relative minimum requires that 

fi° = 0, (i - 1, • * •, n) 

and 

D > 0, not all /i’s = 0. (11) 

1 1 

The appropriate quadratic form must be in this last case positive 
definite. 
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III. Conditions for a Constrained Maximum 
As before, let 

Z = /(x i, • • • , X n ) 

be a defined function with continuous second partial derivatives in 
an open region S. In addition, we impose subsidiary conditions 
on the admissible values of our variables in the form of the following 
m implicit functions of the n variables, defined in S with continuous 


(m < n ) 


Under what conditions will a point ( X° ) in 5 afford a relative 
maximum to z, and satisfy the subsidiary conditions? Such a point 
is defined as follows: 


second partial derivatives, 


G*(Xl, 

Xn) 0, 

G 2 (x i, • • • , 

Xn) — 0, 

G m (x i, •••, 

, Xn) — 0. 


•, x n ) ^/(x 1 °, 


where 


•, x a °), 

(k = 1 , 


G k (x 1 , • • •, Xn) = G k (x 1 °, • • x„°) =0, (k = 1, • • •, m) 

and (X) sufficiently close to (X°). 

The way to solve this problem which would suggest itself most 
naturally would be to use our m subsidiary conditions to eliminate 
m of our variables by expressing them in terms of the remaining 
{n — m) variables. We assume that the rank of the matrix 


d& 

dG 1 

dG 1 

dxi 

dx 2 

3x„ 

d& 


dG 2 

dxi 

3x 2 

3x„ 

dG m 

dG m 

dG m 

dxi 

3x 2 

dx n 


ZGtl 


(k = 1, 

(i = 1. 


is of order m so that at least one m 2 determinant does not vanish. 
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This insures us that we can in fact uniquely solve our subsidiary 
conditions for m variables in terms of the remaining (n — m). 
Since the numbering of our variables is arbitrary, we may write 
this without loss of generality as 

'ft' (.Xm+lj Xm-\-2y ’ , Xn) • (i ~ 1, t TYl) 

By substitution in our original expression, 2 can now be expressed 
in terms of the variables ( x m+ i , x m+2 , • • • , x„) with no conditions of 
constraint, and the analysis of the previous section can be applied. 

However, there is a great loss of symmetry in such a procedure 
since not all our variables are treated alike. Fortunately, by the 
use of an artifice which can be rigorously justified, it is possible to 
derive a more symmetrical set of conditions. 

Consider the Lagrangean-multiplier expression 

H ” JT(Xi, * , X n ) “ fipCli * 1 Xn) 

“b XiU^Xi, * ' , Xn ) "I" * T" XjnC m (Xi, *, Xn ) , 


where the X’s are constants, not all zero, whose values will be de- 
termined later. Obviously for values of ( X ) satisfying the sub- 
sidiary conditions indicated above, all terms after the first must 
vanish, making our new expression equivalent to z. This suggests 
that for a maximum at (X°) 


^L°_ 

dxi ’■ dxi + Al dxi 


+ x. 


dG m ' 

dXi 


subject to 


= 0, 

{i = 1, •••,») (12) 




It is unnecessary for our purposes to indicate explicitly the rigorous 
proof of this statement. 1 

The last two sets together constitute (« + m) equations 
in the (n + m) unknowns (xi, ••• ,*», Xi, • • • , X m ) . Under the 
proper restrictions the values of all the x’s and X’s are uniquely 
determined. 

1 See Carathdodorv, Variaiionsrechnung und Partielle Differ entialgleichungen Erster 
Ordnung (Leipzig und Berlin, 1935), pp. 164-189. The conditions of (12) are identical 
to those of (13) which follow. Carath 6 odory shows that if some minor of (13) greater 
than m were non-vanishing, then one could — by the implicit function theorem — solve 
for some x in terms of /, where / is both greater and less than the extremum value, /°* 
This is a contradiction. 
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The above first necessary conditions can be expressed inde- 
pendently of the X’s. Consider the matrix 


Ik » j// 

' M. 

d(? 

d& 

dG m ' 


dXi 

dxi 

dxi 

dxi 

|i| fg 

M. 

dG 1 

dj? 

dG m 

P fl 


dXi 

dXi 

d%2 

if: 



• 

• 

ill; : 

M. 

dG 1 

Qj 

1 

dG m 

1 

dX n 

dXn 

dX n 

dx n 


L dXi ’ dXi J 


(* = 1, • 
(* = 1, • 


The rank of this matrix must be equal to m; i.e., every (m + l) 2 
determinant must vanish. Thus, 


df_ . dG k 

dXi ’ dXi 


(«' = 1 , • • •, m + 1 ) 
(k ~ 1, • • •, m) 


and so forth for all possible combinations of ( n ) variables taken 
(m + 1) at a time. These will yield (n — m) independent condi- 
tions on the partial derivatives of / and the G’ s. 

For concreteness consider the case of a many variable function 
to be maximized as of one subsidiary condition of restraint. Ac- 
cording to the conditions indicated above there will result at an 
extremum tangency between the locus of restraint and a contour 
locus of z, (n — 1) conditions familiar in many branches of eco- 
nomic theory. 

In addition to the first necessary conditions upon first partial 
derivatives, there are as in the previous sections certain added 
secondary conditions on the second partial derivatives. The fol- 
lowing must hold : 


E E Hq'hfa 

1 1 


f ff-fflL + x, ££L + .. , x w 

I J \ dXidXj ' 1 dXidx, ^ ' m dXid 


G m ° \ 

3 te ,)**' s0 < 14 > 


= 0. (k - 
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In summary, it is necessary and sufficient for a regular relative 
constrained maximum that 

H% = 0 , = 

and 

£ £ Hifhjij < 0 

1 1 

for 

E ^7 hi = 0, not all fe’s = 0. 

The meaning of these secondary conditions requiring the defi- 
niteness of certain quadratic forms under linear restrictions will be 
dealt with later. J 

IV. Quadratic Forms 

A real (homogeneous) quadratic form in n variables (h u ■ ■ ■ , h n ) 
is defined as follows : 

Q{h 1 , • • •, h n ) = an&i 2 + a i2 hiht +•■•+- aiJhK + 

a, 2 ih 2 hi -f- a 22 h2 2 -)-••• -f- -t- 

( 16 ) 

dnlhnhl + dnzhjjli + • ■ • 4* 0, n Jl n 2 , 

where the a’s are real constants. A quadratic form in the single 
variable x would take the form 

Q(x) = ax 2 ; 

in two variables (x, y) it would be written 

ax 2 + 2 bxy + cy 2 . 

Since the order of multiplication is indifferent, it is clear that 
any quadratic form can be made symmetrical, i.e., 

fly — fly, , fl — — a . (17) 

For if not already symmetrical, we may rewrite it without affecting 
its value as 

E t — 

l l * 
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which is symmetrical. Unless otherwise specified we shall deal 
only with symmetrical quadratic forms. 

A positive definite quadratic form is one which is always positive 
except when all the variables equal zero (at which point all quad- 
ratic forms vanish). A negative definite quadratic form is one 
which is always negative except at the origin. A definite form is 
positive definite or negative definite. A semi-definite form is one 
which never changes sign but may be zero at a set of points other 
than the origin. Thus, a positive semi-definite form could more 
appropriately be called a non-negative form. All forms which are 
neither definite nor semi-definite are called indefinite. 

A quadratic form in n variables is singular if the rank of its 
matrix 2 

a — Qzij~i> 

is less than n; i.e., if the determinant 

K/l = 0. 

It may be observed that all semi-definite forms are singular. 

Forms in less than three variables 

In the case of a single variable, it is easy to determine the 
definiteness of 

Q(x ) = ax 2 . 

The form is (1) positive definite, (2) semi-definite and singular, or 
(3) negative definite depending upon whether 

a $ 0. 

It is more difficult to ascertain whether a form in two variables 
is positive definite. If 

Q = ax 2 + 2 bxy 4- cy 2 

is to be always positive except at the origin (0, 0), it must clearly 
be positive for (1, 0). But 

Q{ 1, 0) « a , (18) 

and so a necessary condition is 

a > 0. (19) 

2 A matrix is of rank r if all its determinants of order (r + 1) vanish. All determi- 
nants of a still higher order will be found also to vanish. 
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Similarly, it can be proved that 

c > 0. (20) 

These conditions are necessary but not sufficient. To derive 
still another necessary condition and determine complete sufficient 
conditions, let us consider the form as a quadratic equation in x 
with parameter y, 

f{x) = ax 2 + (2 by)x + (cy 2 ) = 0. 

If the form is positive definite, x must not vanish for any non- 
vanishing y. Solving by the familiar rule for extracting the roots 
of a quadratic, we have 

2 by ± V4 & 2 y 2 — 4 acy 2 |~ — b d= Vi 2 — ac 

X ~ 2 a “ L & 

jj. 

But this must yield no real value of x or the form would not be 
positive definite. Hence, the expression under the radical must be 
negative, and we have as an additional necessary condition 

ac -b 2 > 0. (21) 

However, our three necessary conditions (19), (20), and (21) are 
not all independent. The last of them in conjunction with either 
of the first two implies all three. The sufficiency of these condi- 
tions is implied by the fact that the form is positive for the values 
(1,0), zero only for (0, 0), and never changes sign; consequently, 
it is positive definite. 

These conditions may be given a more convenient form. The 
matrix of the form in two variables is obviously 




If we define a principal minor of a determinant as a subdeterminant 
formed by crossing out the same rows and columns so that the 
diagonal elements of the minor are contained in the diagonal ele- 
ments of the original matrices, then our necessary and sufficient 
conditions are implied in the statement : all principal minors must 
be positive. These are three in number, but not being all inde- 
pendent, a complete set may be specified by making successive 
principal minors positive starting with the upper left-hand comer 
and adding successively corresponding rows and columns. (In this 
case there is only one row and one column to add.) 
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For the form to be negative definite it could be shown along 
similar lines that 


a < 0 , 
a b 
b c 


> 0 ; 


( 22 ) 


or that the first principal minor is negative, and the successive ones 
alternate in sign; those of odd order being negative, and those of 
even order being positive. This is because is negative definite 
only if [_ — is positive definite. 


Forms in any number of variables 

The special methods applicable to forms of few variables cannot 
be applied in the general case. We must therefore seek more 
general methods. A non-singular transformation of variables of 
the type 


hi b ijp j , (i 1 , * • ■ , n) 

i 

or using matrix notation, 

h = bp, p — b~ l h, 

leaves the quadratic form positive definite if originally positive 
definite. This is made plausible from the recognition that the 
origin goes into the origin, and that every point in the h space is 
uniquely related to every point in the p space. Consequently, Q 
after the transformation takes on only the values taken on before. 

Q(h) = Q(bp) = Q(p). 

In particular, let the transformation b be such as to take Q into 
a sum of squares of the form 3 

Q(bp) = d\pi 2 + &2p2 2 + • • • + &npn - ( 23 ) 


It is well known that an infinity of transformations exist which will 
do this. 


3 In matrix notation 


Q — h'ah , 

— p'(b'ab)p, where h = bp. 

b must be chosen so that 

m di o o 

0 <&2 * * * 0 


Vab 


0 0 


tin} 


— MAil where o, 


« - { 


1 for i ~ j 1 
D for i j J 
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Our problem is for the moment reduced to the simpler one of 
determining the definiteness of a quadratic form which is a sum of 
squares. Clearly this is positive definite, if and only if 

dj> 0. (i = l, •*•,») (24) 

By the famous Sylvester's Law of Inertia it is known that the num- 
ber of plus and minus coefficients in the a’s is independent of the 
transformation employed. Clearly, therefore, the a’s, if all posi- 
tive after a particular transformation, are all positive for any 
possible transformation. 

Transformation into a sum of squares 

Two out of the infinity of possible transformations are of par- 
ticular interest since they show definitely what conditions upon 
the matrix a are implied by its being positive definite. First, there 
exists a transformation b — gotten by an extension of the method 
of “completing the square” — such that 


_ 41 

~ A 2 

• (25) 



an 


a in 




a 21 

<2-22 

a%n 




a>nl 

a n 2 * * * 

a n n 


A n 

&11 


0-12 


~ A”- 1 

#21 


#22 * * * 

a 2 

1 



1,2 






an 

&12 


A 

a 2 i 

^22 

A 

ai — 

a 

11 

~ A 

; 

an 

an 

an 

; 

a%i 

a 2 2 

an 

A , 

a z 1 

a$2 

&33 


a 11 &12 

Of 21 $22 
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It is necessary and sufficient, therefore, that these ratios are all 
positive. 4 But 

A 1 = on = <2(1.0, •••,0) > 0. 

Hence, a necessary and sufficient condition that a quadratic form 
be positive definite is that the principal minors (A 1 , ■ ■ ■ , A n ) must 
be positive. 

Minimum properties of Q 

Some preliminary observations are in order before discussing 
explicitly the second transformation. Because a quadratic form 
is homogeneous of the second degree in the h’s, if its sign is always 
positive <5n the unit hypersphere defined by 

hf + h 2 + • * • + hf = 1, (26) 

it is positive everywhere except at the origin. 5 There is no loss 
in generality, therefore, in considering the sign of Q subject to the 
side condition (26). 

Since Q is everywhere continuous with continuous derivatives, 
it necessarily attains a minimum on the closed hypersphere. For 
the form to be positive it is necessary and sufficient that the minimum 
of Q on the unit hypersphere be positive; because then, and only then, 
is it positive everywhere except at the origin. 

This minimum can be calculated explicitly. Consider 

<2 = IS aijhihj 

x 1 

4 Cf. H. Hancock, Theory of Maxima and Minima, pp, 89-91. 

5 This is to be seen as follows. If Q were negative for any value of h(= h) not 
necessarily on the unit hypersphere, it would be negative for some value on the hyper- 
sphere. Suppose 

£ A* = * 

1 

Then the point (£1 /r, • • •, h„/r) is on the unit hypersphere and 

t\ 1 
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subject to 

E hi 1 -1=0. 

i 

Form the expression 

Oijhihj - X(£ W - 1). 

ii i 

According to the previous section it is necessary for a minimum that 

2 £ ai)hj — 2\& t - = 0 , (* = !,••■,») 


£ (<z<y — \&a)hj = 0. (i = 1, • • •, n) (27) 

i 

These are linear homogeneous equations which possess a non- 
vanishing solution if, and only if, the determinant 

D(\) = \a i} - \5 y | = 0.® (28) 

This is an reth degree equation in X and will possess n roots, some 
of which may be repeated. For simplicity, we assume them to be 
distinct. Let us indicate these by 

Xi < < • • * < x„. 

At least one must be real since a real minimum does exist. Ac- 
tually, by a well-known theorem of Hermitian matrices of which 
our symmetrical real matrix is a special case, all are necessarily 
real. 7 Corresponding to any X*, there exists a unique solution 
( hi k , • • • , h n k ) satisfying equations (27) and lying on the unit hyper- 
sphere defined by (26). 

Also, multiplying the first equation of (27) by hi k , the second 
by h 2 k , etc., and adding we find 

X* = £ £ = G(*‘)- (* - 1 ,•••,*) (29) 

i i 

It follows that the smallest of the X’s, Xi, equals the minimum of 

6 This is called a secular equation , a determinantal equation , and the characteristic 
equation of the matrix a. Its roots are the latent roots of the matrix a. 

7 H. Hancock, Theory of Maxima and Minima , pp. 107-109. See also the discussion 
on orthogonality in footnote 9 below. 
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the quadratic form. Hence, a necessary and sufficient condition that 
a quadratic form he positive definite is that all the latent roots of its 
matrix be positive . 8 

Similarly, a form is positive semi-definite if none of the X’s is 
negative, although one or more is zero. A form is negative definite 
if, and only if, all the X’s are negative. 

Second transformation into a sum of squares 

This can be proved in still another way. Consider the trans- 
formation 


pk = If hfhi 


(£ = !,•••,») 


( 30 ) 


It is orthogonal, i.e ., 9 




( 31 ) 


Z hfhi” 1 = hm- 

1 

8 It can be shown that X 2 is the minimum of Q on the set of points lying on the unit 
hypersphere and satisfying the constraint defined by 

hj^hx -f* • • * -J- h,}h n = 0. 

If we adjoin still another constraint, 

Jlffix 4 " hffllz hrfhn as! 0 , 

As minimizes Q. Generally, A P minimizes Q subject to 

i v = 1, 


Z h’h< = 0. O' = 1, — 1) 

1 


* In matrix notation 


b « 

Orthogonality is defined by 

bb' ~ I; therefore, b' = b~K 
Therefore, p = bh implies h — b F p. 

To show that any two different h columns are orthogonal, note that 
ah h = A k h k , ah? - A ,'W, 

and that 

0 = (fr'Yah* - {h k Yah? = (A* - A ,)(#)'#. 

If the two roots are unequal, the very last product must vanish and the h's are orthogonal. 

Because two vectors consisting of conjugate complex numbers cannot be orthogonal, 
this shows the truth of the previous assertion that all roots of a symmetrical matrix 
must be real. 
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Hence, 

hi — £ hi k pk, (* = 1, ••■,») (32) 

i 

and 

<?(£) = E Z E E Upkhrpm 

11 11 

= E E (E Cbijhi k pk)hj n p m 10 
1 1 1 

= E W. . (33) 

1 


This shows that the original quadratic form can be transformed 
into a sum of squares whose coefficients are the latent roots of the 
matrix a. This furnishes a second proof that the Vs must all be 
positive if the quadratic form is positive definite, and conversely . u 


Characteristic equation of a matrix 

Let us consider in more detail the nth degree determinantal 
equation 

D(V) = \o>n — XS ti j = (Ai — X)(X2 — X) • • • (A n — A). (34) 

Expanding out the determinant, this becomes 

(— X)” + Ci(— X) n_1 + • • • + c„_i(— X) + c„ = 0, (35) 

10 Proof that the expression in parentheses above equals X& is derived by multiplying 
the 4th equation of (27) by h m and summing. This gives 

n n n 

OMIT ” Xfc ^ ki k hi m — 'hk&km 
1 1 ' 1 • 

because of the orthogonality property of the transformation b, 

11 The above discussion implicitly assumes that all latent roots are distinct. If they 
are not, the matrix a can still, because of its symmetry and linear divisors, be transformed 
into a diagonal matrix whose elements are the latent roots, but not by a unique orthogonal 
transformation. 
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Ci = On + an -4 b dnn - sum of all n principal minors 

of the first order. 

on oj3 _ sum of all n(n — l)/2 

o 3 i #33 ‘ ' — principal minors of the 

second order. 


Ci — 


an a i2 
&21 &22 


+ 


(36) 


c r - sum of all n\/r\(n — r)l principal minors of the rth 
order. 


Cn = | | = z?(0). 

From the familiar relationship between the roots and coefficients of 
a polynomial we have 

Ci = Xi + X2 + • • • + X„ = an + <222 ~b • • * + a nn 
c 2 — X1X2 + X1X3 + • • • + XiX„ 

+ X2X3 + • • • + X 2 X n 
+ • • • + X„_iX„ 


(37) 


c r — sum of all possible products of (Xi, • ■ • , X„) taken r 
at a time. 


c n - |a«| = D(0) = X 1 X 2 • • ■ X* > 0. 

Descartes’ rule of signs requires that all the c’s be positive if 
all the X’s are to be positive, and conversely. It is clearly sufficient 
that all principal minors be positive. 

It is obvious that definiteness of the quadratic form Q implies 
definiteness of all quadratic forms made up of subsets of the vari- 
ables, for these are simply special cases when some of the h’s are 
set equal to zero. The latent roots of the matrices of all such 
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subsets must be positive. From a consideration of the minimum 
properties of the X’s given in footnote 8, it follows that any such 
latent root X* satisfies the relation 

XiSX*5 X„. (38) 

From this fact we have another proof of the necessity that all 
principal minors must be positive since the product of the latent 
roots of a submatrix equals a corresponding principal minor. 

Definiteness of inverse matrix 

Repeatedly in economic problems it is important to be able to 
evaluate the principal minors of the reciprocal or inverse matrix 
ar 1 when a is known to be definite. 12 The following theorem is, 
therefore, very useful. If a is the matrix of a positive definite form, 
then a -1 is the matrix of a positive definite form. Let 

Q = h'ah. 

Perform the transformation 

h = a~ x p, (39) 

where a — a' because of the symmetry of the matrix a. Then 

Q = p'a~ v aa~ x p = p'a~ x 'Ip — p'ar x p. (40) 

This shows that the original quadratic form can be transformed 
into one whose matrix is the inverse of the original matrix. From 
our previous discussion this must be definite. 13 

12 

ar 1 = 


where A is the determinant of the matrix a f and An is the cofactor of the element in the 
fth row and jth column of the determinant A ~ | an | • ar 1 is termed the inverse of a 
because the transformation ar 1 following the transformation a is equivalent to the 
identity transformation I because of the property 

aa~ x — a~ l a = I. 

Only non-singular matrices have inverses. 

18 Sylvester’s Law of Latency states a more general relationship. The latent roots 
of f(a) are equal to /(X) where the X’s are the latent roots of a. This follows from the 
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V. Quadratic Forms Definite under Constraints 

Let us now derive necessary and sufficient conditions that a 
quadratic form 

n n 

Q “ £ dijhihj 

i 1 

is positive definite for values of h satisfying m(< n) linear condi- 
tions of constraint 

ilgfhj = 0, (a = 1, •••, w) (41) 

i 

where Jg/I is of rank m . 14 It would be possible of course to 
eliminate m variables by use of the equations of constraint, sub- 
stitute in Q, and apply the theory of the previous section to a 
quadratic form in ( n — m) variables. But this is lacking in 
symmetry. 

following identities: 

ax = \Ix 
f(a)x - f(\I)x 

= mix. 

Where /(a) = a -1 , the theorem above emerges as a special case. In the above theorem 
/ must be a rational function. 

14 It is, of course, sufficient that a be the matrix of a positive definite quadratic 
form, since, if it is positive everywhere, it is clearly positive for the subset of values 
satisfying the m conditions of constraint. But this is not a necessary condition. The 
failure to recognize this was responsible for technical errors in economists’ treatment of 
utility theory up until Slutsky’s classic treatment of the subject. 

The maximum of 

JJ ss: * * * , Xn) 

subject to the side condition 

n 

£ Pm = i 

i 

does not require as secondary conditions that the matrix of the Hessian 

L<Piil 

be a negative definite form. In fact, the definiteness of this matrix is not invariant 
under a general monotonic transformation of the utility index of the form 

F = F(cp) 

Fi - F'w 

Fa - F'<pij + F”<pi<pi, 

whereas the correct conditions later to be developed are invariant. 
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As in the previous section, there is no loss of generality in con- 
sidering the behavior of Q on the unit hypersphere defined by 

£ - 1 = 0. (42) 

i 

This additional side condition is adjoined to the m equations of 
(41), and Q is to be minimized. Form the expression 

n n m 

F = IE a.Mi + 2 Z - W - D, (43) 

1 1 1 

where X and the /x’s are Lagrangean multipliers. At a minimum 
we must have 

n m 

Hi = Z 2 {anhi + Z FagC ~ X&O =0, (i = 1 , ■■■,’>%} (44) 

1 1 

or 

n m 

12 (CLiJ A 0|’y ) }l j “h Zt U<xgi a ”0, (i “ 1, * * ’, fl) 

1 1 

£ gfh = 0, (« = 1, •••,*») (45) 


z W - 1 = 0 . 


1 


A non-vanishing solution of these equations exists if, and only if, 
the following bordered determinant vanishes, 


A»(X) = 


&ij \d iS gi a 

gf o 

— ( — l) m A(X — Xi) (A — X 2 ) 


(X - X n _ m ) (46) 


where A > 0. The roots of this (n — m)th degree equation are all 
real because of the symmetry of the determinant. Let them be 
written in algebraic order 


Ai ~ X2 = ... ^ X„ — m . 

Then if the form is to be positive definite under the above con- 
straint, the smallest root, Xi, easily shown to be equal to the abso- 
lute minimum of Q under constraint, must be positive, and con- 
versely. Hence, it is necessary and sufficient in order for a form to 


FOUNDATIONS OF ECONOMIC ANALYSIS 


37 ? 

be positive definite that all roots of the above bordered determinant 
equation be positive.™ 

Recalling that the roots are all real and applying Descartes’ 
rule of signs, it can be shown as in the previous section that the 
following bordered principal minors must all be of sign ( — l) m . 


0 < (- l) m B r = (- l) m 


gi“ 


0 


(i,j = 1, • • *, r) 

(«,j8~ 1, ■••,») (47) 

m + 1 = r =§ n 


These ( n — m) conditions are both necessary and sufficient. 

If the form is to be negative definite under constraint, the de- 
terminants B t must alternate in sign, the first one being of sign 
(— l) m +\ the second of opposite sign, etc., i.e., 

(— 1 ) r B r >0, » + 1 S r = «. (48) 

The inverse of the bordered matrix 

It is of considerable importance for many economic problems 
to know what is implied with respect to the inverse or reciprocal 
of our matrix. The latter can be written in matrix notation 


O 11 

Om 

gl 1 ■ 




O n 1 

0*7111 

gm 

' ■ gm m 

- T— 

g 

gl 1 - * 

g n 1 

0 • 

0 

Lg' 

0 

. . K 

gn m 

0 • 

• 0 . 




The matrix has been partitioned into four submatrices; one is n 
by n, another m by n, still another n by m, and the last m by m. 
The matrix is symmetrical, i.e., 

R = R'. (50) 

ls The form is negative definite if, and only if, all are negative. If the roots differ 
in sign, it is indefinite. If one or more are zero, but the rest are of the same sign, it is 
semirdefinite. 
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The inverse or reciprocal matrix is defined by 
RR~ l = I = identity matrix. 

It can be written 


— 1 

[o^Q 

-n 

11 

r 

0^| 

ol 

|<buO 

_I 

«l 

L_ 


Performing the operation of matrix multiplication, we find 


r ad + gg' 

ag + gq~ 

= 

~I 

_0] 

L g'd + o 

g'g + o . 

.0 

I J 


= I. 
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( 51 ) 

( 52 ) 


This implies the following four matrix identities : 

ad + gg' = I, 

ag + gi = 0, " /e*\ 

g'd = 0, 
g'g = I- 

It can be shown that & is the matrix of a positive semi-definite form 
provided Q is positive definite under constraint. It is only of rank 
(n — m); i.e., all determinants of higher order than (« — m) vanish. 
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THE THEORY OF DIFFERENCE AND OTHER 
FUNCTIONAL EQUATIONS 

1. In the first mathematical appendix I have included a very- 
brief summary of such theorems as play an important role in eco- 
nomic analysis. However, it seems desirable to give a fuller treat- 
ment of difference equations and other functional equations. First, 
although in many ways the theory of simple difference equations 
is more, elementary than that of differential equations, it is not 
customarily taught in mathematics courses, and no convenient ex- 
positions of the subject from the standpoint of the economist are 
to be found. 1 Second, just as differential equations are of cardinal 
interest to the physicist, so difference equations are of the greatest 
importance to the dynamic economist, and hence their exposition 
deserves a prominent place. 

A Single Equation 

2. A difference equation can be written as an implicit relation 
of the following form 

f[y(t + n), y(t + n - 1), • ■ • , y(t), Q = 0. (1) 

This will be said to be of the nth order because it involves the 
variable y at n different (equally spaced !) periods of time. It may 
be noted that the fact that the y values are defined for t values 
differing by an integer does not really imply any restriction ; for 
if the equally spaced values differ by any quantity, we may redefine 
our time variable so as to make this difference equal to unity. 
Although the values of t differ by an integer, it is not necessary 
that t take on only integral values. However, for purposes of eco- 

1 The economist may consult with profit G. Boole, Treatise on the Calculus of Finite 
Differences (London, 1880); L. M. Milne-Thomson, The Calculus of Finite Differences 
(London, 1933); P. M. Batchelder, An Introduction to Linear Difference Equations 
(Cambridge, Massachusetts, 1927); N. E. Nor iund, Vorlesungen iiher Differ enz&nrechnung 
(Berlin, 1924) for a more advanced treatment and bibliography. 
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nomics it is usually only necessary to consider the values of y for 
integral values of time. By breaking up our time into small enough 
units we can keep the error involved in this procedure down to any 
requisite level. As we shall see, this greatly simplifies the mathe- 
matical theory and makes it resemble very closely the theory of 
differential equations. 

Equation (1) is an identity in t. The reader will see from this 
that it may be written in very many ways, e.g., as a single relation 
not between \jy(t -f- n), • • - , y (/)]], but as a single relation between 
Qy(£)i • • • , y(t — n)2> e tc. However, because equation (1) contains 
as its last factor the explicit term t, rewriting our difference equa- 
tion in the manner suggested will involve a slight modification of 
the form of the / function. Non-historical systems, which are of 
the greatest interest in dynamic economics, do not involve time 
explicitly and therefore omit the last t in equation (1). 

3. Difference equations receive their name because they involve 
the value of a variable at different points of time. As written in 
equation (1), however, there is no direct use made of the notion of 
finite differences A n y, defined in books 2 on interpolation as follows : 

Ay(t) = y(t + 1) - y(t), 

A 2 y(t) = A y(t -f 1) - A y(t) = {y{t + 2) - y{t -f- 1)} 

- \y(t + 1) - y{t ) } 

= y(.t + 2) - 2y(* + 1) +y(t), 

• ( 2 ) 

A n y(t) = A n ~ x y{t + 1) — A” -1 y(£) 

n 

A °y(t) = y(t). 

It will be intuitively clear that a knowledge of n adjacent values 
of y will permit us to calculate the value of y and of its first (n — 1) 
differences. (See equation 6 below.) Therefore, by substitution 
into equation (1) we can always write it in the following form: 

F[A»y(J) , A -"MO , •••, Ay (t) ,y(t),q = 0, (3) 

2 See E. T. Whittaker and G. Robinson, The Calculus of Observations (3rd ed., 
London: Blackie, 1940), chaps, i-iv. 


382 FOUNDATIONS OF ECONOMIC ANALYSIS 


where F is related to / as follows 

F[A»y, • • •, A y,y,Q= fly + nAy 

+ A 2 y + • • • + A Ay, ■ ■ ■ , y + Ay, y, f ]. (4) 

Not only can every difference equation like (1) be written as (3), 
but also conversely, because of the relations given in (2), every 
equation of form (3) can be written as form (1). Thus, the two 
forms are completely equivalent, and we can choose between them 
on the basis of convenience. Actually, the form of equation (1) is 
most frequently encountered in the literature, although later it will 
be shown how either may be handled. 

4. Their relationship to each other may be further revealed if 
we examine briefly the notion of an operator. Thus, A may be 
thought of as a difference operator defined by the first equation 
of (2). In a similar way we may define the differential operator 
D(— d/dt ) by the relation 

Dy(t) = y'{t), D 2 y(t) = By’ if) = y"(t), etc. 

In the last century the English mathematician Boole, following up 
the earlier fruitful suggestion of Leibnitz, employed the shifting 
operator E defined as follows 

Ey(t) = y(t -f- 1), E 2 y(t) = y(t + 2), etc. 

The success of the operator method stems from the recognition that 
these may be treated as if they were algebraic quantities. Thus, 
leaving out the y function completely, we may write out the 
operator identity 

E = 1 + A. (5) 


Treating these as algebraical expressions subject to the usual bi- 
nomial theorem we derive immediately the relations 


E n = (1 + A) n = 1 + nA + ^ A 2 + • • • 


+ 


n\ 


(n — i) \i\ 


A i + 


A" = (E — 1 )" = E n - nE n ~ x + ■ - ^- E n ~ 2 + • • • 


( 6 ) 


+ (- ty 


n\ 


(n — i) \i\ 


ftn-i _j_ 


Note the agreement of the second of these with equation (2), and 
the use of the first of these in connection with equation (4). 
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Exercises : 

1. The algebraic deviation of the price of hogs from the equilibrium value 
is in any year f the deviation of the previous year, but of opposite sign. Set 
up a difference equation for the price of hogs. Intuitively is the process stable 
or unstable? (Hint: Use as your variable the deviation of price from the equi- 
librium value.) Write out the first few terms of the sequence. 

2. Form a difference table of the numbers 2, 4, 8, 16, 32. How many 
difference columns can you fill in with the given numbers? What is the general 
rule relating the number of columns of the difference table which can be derived 
from a given set of numbers? Note the triangular form. 

3. Write the difference equation of the first exercise in the form of equation 
(3) rather than (1). 

4. Write the difference equation y(t + 2) — a(l + +* 1) + a&y{t) = 0 

in terms of the E notation and also in terms of the A notation. 

5. What is the meaning of E~ l ? 

6. The receding difference operator V is defined by Vy(/) = y(t) — * y{t — 1). 
Give its relationship to Er\ corresponding to the relations of equation (5). 

7. Write out the usual Taylor’s expansion for y(t +■ 1) in terms of y(t) 
using the D notation. Verify the relation e D = E. (Hint: Use the series 
expansion e a = 1 + x/ll + x 2 /2 ! + • • *.) What would seem to be meant by 
sine D? by 1/(1 — A)? 

8. What is the first difference of t 2 ? the second and third differences? What 
is the third difference of any second degree polynomial? What is the nth 
difference of any nth. degree polynomial? the ( n + l)th difference of an nth 
degree polynomial? 

9. Write out the difference equation satisfied by the numbers in exercise 2. 

10. Let us define ascending factorial polynomials as follows: x (n) = #(& — 1) 
■••(# — n + 1). Verify that Ax (n) = wx (n ~ l) . Of what does this remind you? 

Systems of Equations 

5. Up until now we have been concerned only with a single 
variable. In most applications to an economic system many vari- 
ables are involved. Hence, we must introduce the notion of a 
system of difference equations in m variables (y lf * •, y m ). These 
consist of a set of difference equations in the variables in question. 
Just as a static system can only hope to be determinate if it consists 
of as many equations as unknowns, a dynamic system must contain 
as many independent, consistent equations as there are unknowns. 
Thus, our system takes the form 

J^EyiCt + n)f + n), • , y m (t + 

yi{t)r * * m t y™(f ) » 0 = o, 


( 7 ) 


+ n), y*(t + n) t • - + n); • • * ; 

yi(t)t yz(t)t * * *> y*Q); 0 ^ 0 ? 
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As before, if desired this can be written in finite difference form by 
making use of the relationships between A and E. It is not neces- 
sary that each of the equations in the above set should involve the 
same number of lags; it is often convenient as a matter of notation 
to write each as if it contained the same number of lags as the 
greatest number, n, which occurs in any of the equations. From 
the discussion at the end of section 2 of this appendix it will be 
clear that we can shift our time reference point so that some 
yi(t + n ) appears in each equation. 

6. A difference equation system such as is defined in equation 
(7) can be thought of as an identity which holds for each value of t. 
It is equivalent then to an infinite number of implicit relations. 
This does not lead to overdeterminateness because each new time 
period defines as many new variables as there are equations. In 
fact, if we confine t to a finite number of consecutive values, we 
shall always find that we have less equations than we have un- 
knowns. Thus, the system is formally indeterminate unless we ad- 
join certain so-called “initial conditions” or “boundary conditions.” 

This may be illustrated by a sum of money growing at six per 
cent interest per annum. Its behavior over time follows a simple 
difference equation, but the process is completely unambiguous 
only if we specify how much money there was to begin with; or 
alternatively, if we specify to what value the principal will have 
grown by any given date. 

In the above example of a first order difference equation involv- 
ing a single variable only one initial condition can be arbitrarily 
specified. Therefore, one is said to be the order of the system. 
If we had a system consisting of two difference equations each of 
the third order (i.e., each involves three lags), then the number of 
initial values which can be arbitrarily prescribed would be seen to 
be two times three. In general, where there are m difference equa- 
tions in m variables, and each involves n lags, the number of arbi- 
trarily assignable initial conditions will be seen to be m times n, 
and this number will be termed the maximum order of the system. 3 

7. Adding initial conditions equal to the order of the system in 
effect supplies us with the missing equations discussed in the pre- 

3 In the interests of expositional convenience and uniformity I have departed from 
the usual mathematical custom of saying that in this case the order of the system is at 
most n times m. By adopting the convention described at the end of section 5 I have 
insured that the order will be exactly n times m. 
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vious section so that our unknowns no longer exceed the number of 
our relations. Actually, it is well known that a system of implicit 
equations equal to a specified number of variables may not de- 
termine a unique solution because of inconsistency, dependence, 
multiplicity of roots, etc. However, in any relevant, empirical 
economic system it is the task of the economist and not of the 
mathematician to specify an adequate number of relations to make 
the system determinate. Moreover, in causal dynamic systems it 
is natural to think of causation as taking place unilaterally in time, 
from past to present rather than from future to present. There- 
fore, it is convenient to concentrate upon the case where the speci- 
fied boundary conditions are truly initial conditions sufficient to 
start the system off, after which it determines its own behavior at 
each step. This means that the implicit equations of (7)»can be 
solved explicitly for values of yfit + n) as single-valued functions 
of “previous” y’s. 

y*(t + n) = J^iit + n — 1), • • • , y m (t + n — 1); • • • ; 

yiOO, • • • , yJfi ) ; 0- O' = 1, • • • , m) (8) 

Under these conditions, if we specify any initial conditions of 
the form 


yi(0) 

y 2 (0) • - • y m ( 0) 

_yi(n - 1) 

yffn - 1) • • • y m {n — 1) . 


then we can clearly determine from equations (8) the quantities 
, y m {n)~\. Adding these to the previously prescribed 
variables, we are in a position to determine the y’s at the next 
instant of time, and so forth ad infinitum. In other words, by 
the use of the familiar principle of mathematical induction we can 
state an important existence and uniqueness theorem: a system of 
difference equations of the type indicated in ( 8 ), subject to prescribed 
initial conditions as given in (P), possesses a solution for all positive 
values of t. The solution is unique. 

This theorem is valuable, not because it tells us how to find a 
convenient solution, but because it assures us that once we have 
found a solution, and verified it by substitution in the difference 
equation and initial conditions, then we can rest assured that it is 
the only possible solution. 
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8. In this section it will be shown that any system of order S, 
whatever be the values of m and n, can be replaced by a system of 
5 first order difference equations in S variables. This reduction to 
what may be called normal form is always possible by means of 
redefining 5 = nm new variables, Yu, • • •, Y nm , as follows: 

Y u = Y t 

Yn — EYi 

0‘ = 1, • • •, ra) (10) 

Y ni = E n ~ l Yi. 

In terms of the new variables the reader can show that the system 
defined by (8) becomes 

EY U = Y 2i 
EYu = Y zi 

(*«!,•••,») (11) 


EYn^i = Yni 

E Y n i = J\ 


Our initial conditions are written simply as 


lYrton = cto - 1)]. 


( 12 ) 


By way of illustration let us consider a system consisting of 
one second order equation in one variable. 


y(t + 2) + aiy(t + 1) + a 2 y(/) = sin t ; y(0) = b 0 , y(l) = bi. (13) 


The order of this system is clearly two. Defining new variables 
as follows 

Yi(t) = y{t), 

Yt{S) = Ey(t), (14) 

we can write the above system in the normal form 


E Yi(t) — diY — a,Y t (t) d - sin/, 
EY 1 (t) = Yi{t), 

TO) = bo, 

Yt( 0 ) = bu 


(IS) 


The reader should work out several examples, such as that of 
exercise 4, section 6 above. 
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It is to be noted that equations (7) can by a similar definition 
of variables be written as S first order implicit equations. Indeed, 
an indeterminate system, involving more unknowns than equa- 
tions, can always be converted into an (equally indeterminate) 
first order system. 

9. We have seen that a single high order equation can be trans- 
formed into a number of first order equations. Is the reverse 
possible? Can a first order system (and, therefore, any system) 
be transformed into a single equation, whose maximum order is 
equal to the maximum order of the normal system? Usually the 
answer is yes, although the reduction is easier to indicate than 
actually to perform, and in singular cases, where certain conditions 
on the partial derivatives of the f’s fail, the reduction may be im- 
possible. Also, a well-determined explicit system may after reduc- 
tion be expressible only as an ambiguous implicit high order relation 
in one variable. Thus, the well-determined system 

Ey 2 = yf; Ey i = y 2 + yi (16) 

can be reduced to the well-determined second order equation in yi 

E 2 y i - Eyi = yi 1 , (17) 

but in yt we get only the ambiguous implicit equation 

(E 2 y 2 - Eyi - y 2 2 ) 2 = 4 y^Ey,.. (18) 

Let us suppose that we have a normal system of order S, 
yi(t -f- 1) = • • • , y,(t), t], {i = 1, • • • , n) (19) 

or, in abbreviated notation, 

EY = /(F, t). 

It will necessarily involve 2S variables, namely the y’s at two in- 
stants of time. Given S equations in 2 S variables, it is not possible 
to eliminate all but one of the y’s and be left with a difference 
equation involving only that y. If we write down the identities 

E 2 Y = J(EY, t + 1) 

E*Y - J(E 2 Y, t + 2) 

( 20 ) 


E’Y = J{E‘~ 1 Y, t + S- 1), 
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we shall have introduced as many new variables as we have equa- 
tions. But not all of these need be eliminated ; only those involv- 
ing all variables except the one in question. All together we 
have S 2 equations in 5 2 + A variables, and we seek to eliminate 
(S — 1)(S + 1) variables. Our number of equations exceeds this 
by one, so we can hope to perform the elimination, the remaining 
equation being the desired S order equation in a single variable. 4 

10. Everything said about difference equations applies with 
very little modification to differential equations. The reader 
should write out a single differential equation corresponding to the 
single difference equation (1) by writing out the latter using the 
operator E, and then substituting for E the operator D. Simi- 
larly, he should write out the system of equations (7) in terms of E, 
and then derive a system of differential equations by the substitu- 
tion of D for E. The same applies to initial conditions and to the 
notion of the order of a system. The argument showing how a 
system of equations can always be reduced to first order normal 
form also applies directly to differential equations. He may check 
his reasoning by referring to the first chapter of the Moulton book 
just cited. 

Exercises: 

1. The national income of each of two countries equals domestic investment 
plus domestic consumers’ goods production plus exports. The consumption 
and imports of each country are respectively determinate functions of income 
in that country in the previous period. Set up the system of difference equa- 
tions describing the behavior of each national income over time. Specialize 
these by assuming constant marginal propensities to consume and import. 

2. Use the 45° line graphical method of chapter x to analyze whether or 
not the equation, Yt+i 2 -f Y t 2 = 9, defines a unique causal system. If neces- 
sary, modify the definition of the equation slightly so as to remove all am- 
biguity. Can the initial value, F 0 , be assigned completely at pleasure? (Hint: 
try a few arithmetical examples.) 

3. The system, Ey(t) = s(2); Ez(t) = .5 y{t) + 2 z(t), was converted to 
normal form from a single equation in y. What was the exact form of that 
equation, and what definitions were used in converting it to normal form? 

4. Given the system, Ey\ = y 2 ; Ey 2 — yi, derive a second order equation 
in each of the variables alone. 

4 The reader may consult F. R. Moulton, Differential Equations (New York, 1930), 
pp. 6-9, for a more complete discussion of this problem taken with special reference to 
differential equations. For linear equations this is worked out in some detail in P. A. 
Samuelson, “A Method of Determining Explicitly the Coefficients of the Characteristic 
Equation,” The Annals of Mathematical Statistics , XIII (1942), 424-429. 
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5 . A body falling in the earth’s gravitational field satisfies the equation, 
(Px/dt 2 = — g. Convert into norma! form. 

6. Write out a system of difference equations in A form. Define the con- 
cepts of order, normal form, etc. 

11. If only numerical answers are desired, one may simply use 
the difference equation at each instant of time to calculate suc- 
cessively the new values of the variables, and so forth for as long a 
time interval as is desired. However, this may be tedious in prac- 
tice, and inevitably there must be rounding off at each stage so 
that the accuracy of the solution deteriorates with its length. (For 
a stable system if the errors of rounding off remain within a certain 
limit, the errors of the solution will remain bounded.) 

The method of actual numerical examples has had considerable 
use in economics, and is valuable from the standpoint of providing 
illustrations. 5 Still one can never be sure that another arithmetical 
example might not contradict the principles inferred from this one, 
so that the method is of limited usefulness even for the purpose of 
providing qualitative information concerning solutions. After all, 
life is not long enough for us to complete even one sequence, nor 
to try all possible initial conditions, nor to vary the form of the 
structural relations. 

Unfortunately, if we simply consider difference equations in all 
generality, there is no simple analytical way to write out explicit 
solutions. The same is true of differential equations. The be- 
ginning student may be taught a variety of ad hoc tricks which may 
work on particular equations, but the advanced student must rest 
content with his existence theorem which tells him that there exists 
a unique solution; often the proof of the existence theorem tells 
him how he may (painfully) approximate to that solution. 

In both of these lines much more complete results may be 
achieved if we restrict ourselves to linear systems. That this may 
be a serious limitation, especially for business cycle purposes, is 
shown in chapter xi. However, in chapter x it is shown how the 
assumption of linearity is not seriously limiting if we are primarily 
interested in stability, especially in the small. 

In much of what follows, therefore, we shall confine our atten- 
tion to systems which have two properties. First, they are to be 
linear. There is the added advantage that most mathematical 

6 E. Lundberg, Studies in the Theory of Economic Expansion (London, 1937). 


390 


FOUNDATIONS OF ECONOMIC ANALYSIS 


expositions are concerned with these. Second, they are not to in- 
volve time explicitly. Most economic systems of interest are 
causal rather than historical in nature and satisfy this require- 
ment, thereby simplifying our task and enhancing mathematical 
tractibility. 

In fact, if we insist that a system be timeless and at the same 
time linear, we are left with the exceedingly simple case of constant 
coefficients, whose theory is perfectly complete and elementary. 
We shall see that exactly the same thing is true of differential equa- 
tions so that one who has mastered either of these branches has 
little difficulty with the other. From a more fundamental point of 
view, each of these is simply the exemplification of a more general 
theory of linear operational equations. 

12. There is, however, one non-linear case where a more or less 
complete solution can. always be indicated. That is the case of the 
first order equation of the form 

yt + 1 = ( 21 ) 

The 45° diagram of chapter x pictures the various types of equi- 
librium solutions, y t = a, with the property, a — f(a) = 0. It is 
also clear from that diagram that a motion which remains bounded 
need not approach an equilibrium position, but may approach a 
periodic motion (one of the boxes of the cobweb theorem). Some 
of these boxes are stable and some are unstable. We may test the 
stability of a motion which repeats itself every S periods by the 
method which Professor Leontief 6 has suggested. Let us define 

Mx) = fix) 

Mx) = /[/(*)] 

( 22 ) 

Mx) = /[/ s -i(x)] = / [/{/ s _ 2 (x)j]etc. 

It is then true that the solution of equation (21) at any time t, 
for given initial condition, y 0 , is given by 

y(t)=Uyo). (23) 

6 W. Leontief, “Verzogerte Angebotsanpassung und Partielles Gleichgewicht,” 
Zeitschrift fur Nationalokonomie , Band V (1934). 
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However, this is more art indication of a solution than a solution 
itself since the /’ s in question may be very complicated functions. 

Suppose that (ai, a 2 , • • • , ai) represents a periodic solution of 
the original equation. This will be true if, and only if, the new 
difference equation 

y«+i = Myt) (24) 

is such that 

Mai) = (i = 1, ■ • •, s) (25) 

The motion will be stable if, and only if, the above equilibrium 
points of the new single difference equations are stable. This can 
be tested by means of the methods developed in chapter x. 

Exercise : 

Show that the stability of the periodic motion depends upon 

l/'(oi)/'(a 2 ) •••/'(«,) IS 1. 

[Hint: What is /.'(a,-) equal to?] 

13. The case of the first order differential equation which does 
not involve time explicitly, 

dy/dt = f(y), (26) 

is one which admits of an immediate solution by quadrature, as 
indicated in chapter x. Its richness of solution is limited, every 
motion being monotonic. Hence, if bounded, the motion must 
approach an equilibrium point. There may, of course, be a number 
of equilibrium points alternately stable and unstable. 

Exercises: 

1. Let f(x) = x\ What is the general expression for/t(x)? 

2. Show that every equilibrium value of /(*) is an equilibrium value of 
ft(x). Is the converse true? What about the stability of the common equi- 
librium points, will they agree? 

Linear Systems 

14. A linear system is one which involves the dependent vari- 
ables in expressions of the first degree, i.e., without higher powers, 
cross-products, nonelementary functions, etc. For the purpose of 
this section it is not necessary that time be excluded or that it enter 
the problem linearly. We may write our linear system in the form 

Lh>m=M> ( 27 ) 
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where L stands for any general operation upon a whole system of 
variables, and / represents a known function or set of functions. 
For the discussion of this section it is not even necessary that L 
represent difference equations rather than differential equations. 
Rather, it may represent an implicit functional of y over the whole 
t interval, and in the notation of chapter ix, page 261, it could be 
written as 

+03 

L[y(t)~] = F[_y(r ) ; f]. 

—co 

The fundamental linear property which L must satisfy is as 
follows : 

L(y i + y,) = L( yi ) + L(y a ), (28) 

from which follows immediately the general relation, 

L(ciiyi + + • • • + a„y n ) 

= aiL(yi) + aiL(yf) + • ■ • + a n L(y n ). (29) 

From this general definition of a linear system there follow im- 
mediately two important general theorems. 

Superposition theorem : The solution of (27) when f is made up 
of the sum of two different fs is equal to the sum of the solutions ap- 
propriate to each. Mathematically, this may be written in slightly 
more general form : 

L(y t ) = fi implies L( £ aiyf) — E affi- (30) 

The proof follows immediately from (29). 

This means that the solution of a system with many forces 
acting on it is equal to the sum of the separate effects of each. 
Even where this is not strictly true, the economic situation may 
approximate to this condition. 

For many linear systems it may be possible to give the solution 
for a fundamental set of f’s into which any / can be decomposed. 
If this is the case, our problem is immediately solvable. 

Our second theorem states: The difference between any two solu- 
tions of (27) must satisfy the reduced (or homogeneous) equation 

L(y) = 0. (31) 

This follows immediately if we substitute each of the solutions in 
(27), and then subtract one expression from the other. The right- 
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hand function will cancel out, and according to (29) we may regroup 
the terms on the left-hand side to achieve the desired result. 

A most important corollary of this theorem is derived simply 
by rearranging terms. The most general solution (i.e., any solution) 
of (20) may be written as the sum of some one particular solution and 
the general solution of the reduced equation (31). 

This permits us to concentrate upon the more easily manageable 
reduced equation in order to determine the uniqueness, multi- 
plicity, boundary conditions, etc., of the solutions. It is often 
possible by ingenuity, good luck, or intuition to arrive at a par- 
ticular solution of the unreduced case, and the present corollary 
tells us how to transform it into the general solution, if only we can 
find the general solution of the unreduced case. 

Exercises : 

1. Except that there are no boundary conditions, show that the above 
remarks apply to a single linear algebraic equation of the form, ax = b. Show 
that when the reduced equation, ax = 0, has but a single solution, the full 
equation has but a single solution. Can the converse be stated? Can you 
state that the uniqueness of either equation guarantees the uniqueness of the 
other? (Hint: Distinguish three possible cases, neither a nor b equal to zero, 
both equal to zero, only one equal to zero.) 

2. Consider two (or more) linear algebraic equations. Show that unique- 
ness of either the reduced or unreduced equations implies uniqueness of the 
other. Show, however, that the reduced equation can have many solutions, 
when the unreduced has none. Enumerate all the possibilities, and with the 
aid of a textbook on algebra show how one can recognize each case. (Hint: 
The terminology of matrices and determinants will be found useful.) 

3. Show that the following integral, and mixed difference-differential 
equations are respectively linear: 

y(t) + k(t - a)y(a)da * /(/), 

y'(t) + ay(t - 1) — f(t). 

Do they involve time explicitly? 

15. In view of the importance of the reduced or homogeneous 
system of the form 

L(y) = 0, (32) 

we shall devote a number of sections to it. Most of our analysis 
will apply to systems of equations, but occasionally it will be de- 
sirable to confine attention to a single high order equation in one 
variable. Where difference equations are concerned, this may be 
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written in the form 

a 0 (t)y(t + n) + ai(t)y(t + « — !) + ••• + a n (t)y(t ) = 0. (33) 

Much of the mathematical theory of difference equations is 
concerned with the case where the a’s are simple polynomials or 
analytic functions, just as much of the classical theory of ordinary 
differential equations is concerned with the case 

a»(t)y n {t) + ffli (t)y n ~ 1 (t) + • • • + o, n (t)y(t) — 0, (34) 

where the a’s are of the same simple type. 

However, if we insist that the economic system be nonhistorical, 
i.e., independent of calendar time, and if we also insist that it be 
linear, then the a’s must be constants, and we have the case of 
constant coefficients. 

16. Whether or not time enters explicitly, it is easy to derive 
from equation (29) the theorem : A linear combination of a number 
of solutions of the reduced equation (32) is itself a solution; i.e., 

L(ciyi + cty 2 + • • • + c s y s ) =0 if L(y ; ) = 0. (35) 

Since the difference between two solutions is a linear combina- 
tion, we have the immediate corollary: 

L( 0) = 0. (36) 

If the y stands for a whole set of variables, the zero must of course 
stand for a whole set of variables equal to zero. 

Usually, in the analysis of differential equations and difference 
equations as well, the mathematician looks for a basic set of special 
solutions in terms of which any solution can be written by linear 
combination. This linear combination, written with undetermined 
coefficients, is called the general solution of the reduced system. 

If there is an existence theorem which guarantees a unique solu- 
tion of the reduced system satisfying certain initial conditions, we 
should of course expect the number of undetermined constants in 
the general solution to be equal to the number of arbitrarily as- 
signable, initial or boundary conditions, or equal to the order of 
the system, S. 

The reader should note that not every set of 5 distinct special 
solutions of the reduced system will form the basis for a general 
solution. Thus, if one or more of the given 5 solutions can be 
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written identically in t as a linear combination of the others, the 
set of particular solutions is said to be linearly dependent and cannot 
form a basis for the general solution. For a linear combination of 
all S solutions can in this case be regarded as a linear combination 
of less than S linearly independent solutions; the number of con- 
stants in such an expression is necessarily less than S, and hence 
we cannot hope to satisfy S arbitrary initial conditions. 

17. However, the reverse is not necessarily true. If we find S 
solutions which are linearly independent (i.e., which are not linearly 
dependent), it does not follow from our existence theorem that they 
form a basis for the general solution. Indeed, there may exist no 
general solution in the sense in which we have been using the word. 
This is perhaps inadequately emphasized in the mathematical texts 
because of their preoccupation with the special case of differential 
equations where a stronger theorem can be enunciated. Thus, 

D'-y{t) = 0 (37) 


has linearly independent solutions (1,0 which form a basis for a 
general solution, in terms of which any solution can be written. 
But 


o 

il 

GJ 


(38) 

has two linearly independent solutions, 



yi(0 = 1, t = 0; y 2 (t) = 1, 

t = 1, 

(39) 

yiOO =0, t rn 0; y 2 (0 = 0, 

t s l, 

which do not provide such a basis. For the function 


y s (t) = 1, 1 = 2, 

y*(0 = 0 , t £2, 


(40) 


is a solution of (38) for appropriate initial conditions, and yet it 
cannot be written as a linear combination of the two linearly 
independent solutions. 7 

7 It may be objected that ys{t) does not satisfy the difference equation (33) for all 
values of t, particularly for i equal to zero. However, a solution is to be thought of as 
satisfying a difference equation if for given arbitrary initial conditions it is in accord with 
the prescribed law. To help clear up the present paradox, it should be pointed out that 
solutions which are linearly independent on one interval of t may be linearly dependent 
on another interval. It is also to be pointed out that the example cited is one in which 
the solution of the difference equation is unique to the right of the initial conditions but 
not to the left. It is also one in which the irreversibility in time disappears if the differ- 
ence equation is replaced by an “equivalent” one of lower order. 
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The absence of a general solution does not mean that we cannot 
find the solution, for given initial conditions. Our existence 
theorem guarantees us that we can find such a solution. Actually, 
such a solution can be written as a linear combination of basic 
solutions. However, the above example shows us that the linearly 
independent basic solutions may have to be different sets for every 
initial point ; there need not exist a set of basic solutions which will 
serve uniformly for all initial times. 

Linear Operator Equations with Constant 
Coefficients 

18. I turn now to the case of difference or differential equations 
with constant coefficients. Since it will be shown later that the 
solution for equations of any order can be written in terms of 
solutions of the first order case, I shall consider this simplest of all 
cases. In the first order case our difference equation becomes 

(E - q)y(t) = fit), 

yito) = b, 

and the corresponding differential equation becomes 

(D-q)y(t) =f{t), 
y{to ) = b. 

Each of these may be considered as special exemplifications of 
a general first order equation in a simple operator, h, 

(h - q)y(t) = f(t), 
y{to) = b. 


(41) 


(42) 


(43) 


For h to be an admissible operator it is only necessary that there 
be an existence theorem guaranteeing that the above equation has 
a unique solution for the arbitrarily specified initial condition. 
How this existence theorem is established need not concern us at 
this point; we have seen how it can be done for difference equations, 
and the reader may refer to any work on differential equations for 
the corresponding proof. In addition to E and D, examples of h 
are A, tD, E + 5, etc. On the other hand, such an operator as D 2 
would not be an admissible h, because the system corresponding 
to (43) would not have a unique solution. This is because two 
initial conditions are necessary, rather than one, to satisfy a second 
order differential equation. 
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We may suppose the solution of the above system to be 
written as a linear functional of /, with q and b as parameters, or as 
Fit, q, b; f)k, where F has the fundamental linear property 

Fit, q, c x bi + • • • + cA; cji + ■•••+ c s f s ) h 

= c i F(f, q, b\; fi) h + • • • + c s F(t, q, b s ; f s )h- (44) 

From this fact it is easy to verify that h is a linear operator 
with the properties 

K c Oh + c-yyf) = cjiyi + cjiy 2 , 
h-y — h(liy), 

( 45 ) 

h n y = hh n ~ l y , 
h°y = ly, 
hO = 0, 

P(h)Q(h)y = Q(h)P(h)y, 

where P and Q are polynomials. 

19. So far we have not defined h raised to a negative power. 
It would be natural to define 


y = h-W) 

as the solution of 

b = fit)- 

However, it is clear from our existence theorem that this cannot 
have an unambiguous meaning unless we specify some initial con- 
dition. This corresponds to the familiar indeterminacy of the in- 
definite integral of a function, D~ l f. We need a constant of inte- 
gration or a definite lower limit of integration before there can be 
a determinate answer. We shall not have occasion to use negative 
powers of h even though readers familiar with the Heaviside opera- 
tional methods will realize the mnemonic advantages to be derived 
from such a procedure. These advantages are counterbalanced by 
the aura of mystery which surrounds some of the usual expositions 
of this method. In general, the methods here outlined will be of 
“classical type,” albeit in somewhat amplified form. It is the 
present writer’s conviction, which need not be sustained here, that 
a careful examination of the Heaviside-Cauchy operational calcu- 
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Ius, of Bromwich-Wagner contour integrals, of Laplace transforms, 
etc., will show that where these differ from the classical methods 
the advantages are in favor of the latter. These remarks are in- 
tended to apply to ordinary differential equations of finite order 
with constant coefficients and do not refer to the use of the Laplace 
transform in connection with partial differential equations, nor in 
connection with integral equations of faltung type, etc. 8 

20. Being first concerned with the reduced equation, where f(t) 
vanishes, we concentrate upon solutions of the form Fit, q, b; 0) A . 
Because of the linear properties of h given in (45), this can be 
written in the special form bF(t, q, 1 ; 0)a. Hence, peculiar impor- 
tance attaches to the solution of the reduced first order equation 
with initial condition equal to unity. Let us determine, there- 
fore, the exact form of the F function for h equal to E and for h 
equal to D. 

Clearly if 

Ey = qy, y(t 0 ) = 1, 

then 

y(t 0 + 1) = qy(.t 0 ) = q, 


(46) 


y(t 0 + t) - qy(t 0 + t - 1) = qqy( h + t - 2) = • • • = q\ 

y (t) = qt~t o, 

or 

Fit, q, l;0)i = q l ~ l \ (t ^ t 0 ) 

Of course, the final proof of the above relation is to be found 
in the fact that when substituted into the difference equation, 
satisfies it and the boundary condition in question. Our 
existence theorem guarantees that it is the only solution. 

We may rewrite the differential equation (42) in the form 

dy/y = qdt. (47) 

Integrating each side, taking antilogs and inserting the appropriate 
constant of integration so as to satisfy the initial condition of unity 
at t equal to to, we easily find the solution to be 

F(t, q, 1 ; 0)b = > ^ 0 ) ( 48 ) 


8 G. Doetch, Theorie und Anwzndung der Laplace Transformation (Berlin, 1937). 
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The reader may verify that the operators A and tD have re- 
spectively the fundamental solutions 

Fit, 2, 1;0) A = (1 + qy~\ (t is /„) (49) 

and 

Fit, q, 1 ; 0) tB = (t/t 0 y. it ^ to) (50) 

Thus far, it has been implicitly assumed that q is a real vari- 
able. However, no detail of the procedure or proof would be es- 
sentially changed if q were assumed to be a complex number of the 
form (a + ib). It may be argued that no real economic process 
need be described in terms of complex numbers. This can be 
readily admitted. However, when we come to equations of higher 
order, it will be shown that the solution can always be written in 
terms of purely algebraic combinations of the roots of higher order 
polynomials. In order that every polynomial have exactly as 
many roots as its degree, it is necessary to introduce complex 
numbers. Because these occur in conjugate pairs, it will always 
be possible to regroup terms at the end and express everything in 
terms of real numbers. But it is most convenient to work with 
complex numbers in the intermediate steps. 

21. We may now inquire into the stability of this fundamental 
first order solution. It is clear that the initial condition enters only 
as a scale factor and cannot affect the problem of stability. Under 
what conditions will 

lim Fit, q, 1 ; 0)a = 0? (51) 

t—*<x 

If we examine each of the four specified F’s corresponding to the 
four distinct operators, E, D, A, tD, it is clear that we have the 
following respective conditions : 

E: absolute value of root must be less than one, 
or |g| < 1; 

D: real part of q must be negative, or R{q) <0; 

A: absolute value of (1 + q) must be less than one, 
or 1 1 + g I < 1 ; 

tD : real part of q must be negative, or R(q) < 0. 

In the complex plane the first of these is interpreted to mean 
that 2 must be inside the unit circle so that its absolute value or 
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modulus will be less than unity. For the differential system the 
value of q must be in the left half of the complex plane. Later it 
will be shown that the stability of the reduced equation determines 
the stability of the system. 

22. Having introduced a basic solution of the reduced equation, 
F(t, q, 1 ; 0) h , I now introduce a basic special solution of the un- 
reduced equation. By combining these two, the solution to the 
general first order system (43) can be written. Our special solution 
of the unreduced equation is that for which the initial condition, 
b, is zero. In accordance with our above notation, this can be 
written as F(t, q, 0 ; /)*. How it is to be found for any given oper- 
ator need not concern us here; our existence theorem assures us 
that it exists, and often suggests how it is to be found. 

The reader can verify that the following do represent basic 
special solutions for the respective simple operators. 



0 


etc. 


Because of our existence theorem, no proof other than verification 
is necessary. 

Consider now the expression 


y{t) = F(t, q, 0;f)h + bF(t, q, 1 ; 0) A . (53) 

Substitution shows that it is a solution of the general equation (43). 
Its initial value is 

y(t 0 ) = F(t 0 , 2 ,0;/) a + bF(to, 2 , 1 ; 0+ = 0 + b, (54) 

satisfying the correct initial condition, and thereby constituting 
the unique solution. 

23. We shall show that the most general system of any order 
has a solution consisting of two parts : a special particular solution 
of the reduced equation with vanishing initial conditions, and a 
“general” solution of the reduced equation with assignable con- 
stants sufficient to satisfy the prescribed initial conditions. 
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In fact, once the first order case has been thoroughly analyzed, 
the general case yields to a purely algebraic treatment. In what 
follows, I shall try to bring to a completion the Heaviside program 
of “algebrizing” the treatment of simple operators. As far as I 
know, the mathematical literature, because of its principal pre- 
occupation with special operators, stops short of such a complete 
program ; at some stage the concept of integration or summation is 
introduced into the discussion. Moreover, the present fashion of 
thought introduces transcendental notions such as the Laplace 
transform even in the simplest cases of ordinary differential equa- 
tions with constant coefficients. It will appear from what follows 
that the Laplace transform, however useful it may be to one who 
has grown accustomed to it, is neither necessary for rigor nor 
necessary for analytical and computational short cuts. In a 
fundamental sense it is an artifice which “cancels out of the prob- 
lem,” and aesthetically it represents an alien intrusion having no 
“natural” connection with the problem under discussion. Further- 
more, the present treatment may throw light on the connection 
between so-called “classical” and “operational” methods, showing 
their complete equivalence and identity. Aside from the lament- 
able aura of mystery surrounding many discussions of operational 
methods, the literature abounds with diverse, contradictory claims 
and counterclaims as to the advantages of the alternative methods. 

Despite the facts that simple operators are being discussed and 
that an algebraic treatment is being used, the present treatment is 
essentially classical, with one important addition: full use is made 
of the specialized algebraic character of the equations determining 
solution constants. When so amplified, the classical methods not 
only arrive at the same results as the operational methods, but 
actually go through the same intermediate steps. Wherever the 
two methods diverge, the advantage lies with the classical. For 
those who must have their Laplace transform, the concept is suit- 
ably generalized to any operator in section 29. 

24. We may begin by enunciating what is a theorem and not 
an assumption : If h is a simple operator such that the system 

(h - q)y = fit), (t S t 0 ) / 5S ) 

y(t 0 ) = b V 

has a unique solution for suitably limited functions f, and for q an 
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arbitrary complex number, then the general nth order equation 

Pn(h)y = h n y + aji n ~ l y + a^h^y + • • • + a n -ihy + a n y = /(/), 
y(t 0 ) = b 0 , 
b(t)lt=i 0 = b u 


(56) 


h n ~ 1 y(t)~]t~t a = b n - 1, 9 

has a solution, and the solution is unique. 

To prove the first part, I shall specify one solution, leaving its 
uniqueness to later discussion. First, let us note the algebraic fact 
that every »th degree polynomial has exactly n roots if we count 
each with its appropriate multiplicity. If g; denotes the ith. root, 
we may write our polynomial in the form 

P n (x ) - (x — qi)(x - qi) • • • (x — q n ). (57) 

Let us define n new polynomials as follows: 

Pn-l(x) = P n {x)/(x — qi) = E a.-.n-i*"- 1 - 1 ', 

0 


P n — P n — l(x) / (.X qi) 


n — 2 

Z Cli,n-2X n ~ 


0 


JP T (x) — JP / {x ^n—r) @'i i rX r \ 

0 


(58) 


Pi(*) = Pz{x)/(x - 2n-i) = Z = x - q n 


P 0 (x) == Pi(x)/(x - g„) = 1. 

9 For brevity we shall write as h*y(h), where there is no possibility of 

confusion. 
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Also, let us define some new functions as follows: 

Z 0 (t) = f(t), 

(Ji qi)Zi(t) = Zo(t), 

(h - g 2 )Z 2 (f) = Z 1 (t), 

(59) 

(h - q n )Z n (t ) = Z n ^{t). 

By our hypothesis on a first order system, each of these is deter- 
minable once we specify initial values [Zi{k), Z 2 (t 0 ), • • •, (/»)]. 

In a moment these will be prescribed. 

By virtue of the definitions of Z k (t) in (59), it will be clear that 

m = z 0 (t) 

— (h — q l )Z l (t) 

- {h - qi)(h-q 2 )Zi(t) (60) 

= (h - gi) • • • (h - q n )Z n (t). 

Thus, Z n (t) is a solution of our equation, and it remains only to 
make certain that we pick the Zi(t 0 ) properly so as to yield the right 
initial conditions. 

It is easily verified that 

Z n -,{t) = Pj(h)Z n (t). (61) 

Evaluating this expression at the initial time we have 

Ti cii.jh 3 ' ^Z n {t 0) = y) — Z n -i(t 0 ) . 

0 0 

O'- 0,1,2, — 1) (62) 

These are sufficient equations to determine the right-hand terms 
for a given set of V s. 

To recapitulate our construction: find the roots of the poly- 
nomial P ; by synthetic division or otherwise define the n new 
polynomials, Pc, evaluate Zj(t 0 ) in terms of the b’s; by repeated use 
of the methods of the last section, equation (54), solve in turn for 
Zi(i), Z 2 (t), • • • , Z n (t), using the initial conditions just determined; 
the resulting Z n (t) is the required solution. This proves the first 
part of our theorem. 
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To prove uniqueness, we shall assume that there are two dis- 
tinct solutions and show that this leads to a contradiction. Clearly 
the difference between any two solutions of (56) will satisfy the 
reduced equation with initial conditions all equal to zero. We 
have only to show that the reduced equation with zero initial con- 
ditions cannot have a solution which is not identically zero. For 
the first order case our existence theorem insures that there can 
be no other solution than zero. We shall proceed by induction to 
show that if a reduced equation of order (n — 1) has a unique zero 
solution for zero initial conditions, then so must an nth order 
equation. 

Let us suppose that the (n - l)th order equation has a unique 
vanishing solution and the nth order has a non- vanishing solution, 
Y(t). Then 

{P(h)Y\ = (h - qi){P n —i(h) Y\ = (h — q x )Z = 0, (63) 

where 

¥Y(t „) = 0, (i = 1, •••,» - 1) 

Y(t) ^ 0, 

and Z is defined by this equation. Our hypothesis on the (n — l)th 
order equation’s uniqueness tells us that Z must be nonvanishing, 
since otherwise our non vanishing Y would satisfy the (n — l)th 
order equation. On the other hand, the zero values of the n initial 
conditions of Y make Z(t 0 ) be zero. With an initial value of zero, 
and satisfying a first order equation, Z must vanish identically. 
Thus, we have a contradiction, and our conclusion concerning the 
uniqueness of our solution is proved. 

It is to be noted that the case of repeated roots in the character- 
istic polynomial P introduces no difficulties in the problem. Also, 
it is to be observed that the method of finding a solution outlined 
above is not necessarily a good one to follow in practical numerical 
computation. However, since it depends only upon repeated solu- 
tion of first order equations, it can be readily mechanized or reduced 
to a routine if some suitable method of solving first order equations 
is at hand (e.g., differential analyzer, punch card equipment, etc.). 

Exercises : 

1. Write out the solution for a reduced second order differential equation 
with constant coefficients which has 3/(0) = 1, y'(0) = 0; which has ^(0) = 0, 

y'C o) « l- 
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2. Write out the solution of the unreduced equation with vanishing initial 
conditions. 

3. Repeat (1) and (2) for any operator h, using the F function. 

25. Having established the existence and uniqueness of the 
solution of the general operator equation of the rath order with 
prescribed initial conditions, we may explore better ways of ar- 
riving at its solution. Our work may be divided into two parts: 
finding a particular solution with vanishing initial conditions, and 
finding a solution of the reduced equation which satisfies the pre- 
scribed initial conditions. Clearly, there are other ways of splitting 
up the problem according to the properties the particular solution 
is to have. Thus, in many physics problems the right-hand func- 
tion, /(f), represents a periodic driving force, and the particular 
solution sought is the so-called steady state response of the system, 
while the unreduced equation solution yields the “transient,” which 
dies out in a damped system. 

The unreduced equation involves no arbitrary functions and is 
simpler. We discuss it first. Its solution for the most general 
case can be built up out of first order solutions, F(t, q, 1 ; 0)^. For 
brevity we shall write the latter as F(t, q). Some of its properties 
may now be developed. 

First, since 

F(k,q) = 1, (64) 

~F(t 0) q)^0. (65) 

Also, by its definition as a solution of the reduced first order 
equation, 

hF = qF, 

h 2 F = q 2 F, 

( 66 ) 

h n F = q n F, 

and 

P(h)F = P{q)F. (67) 

Because of the linearity of our system, we have 

p(h) r y(*,g«) -y(f,gi) 1 = P(h)y(t, g2 )-P(ft)y(f, gQ 

L 2a ~~ 2i J 2s ““ 
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If y is a continuously differentiable function of the parameter q, 
we find in the limit 

p v>%-ik p wy- < 69 > 

For the moment we disregard all boundary conditions and seek 
solutions of the reduced equation 

P(h)y = 0. 

First we establish a lemma: If q is a root of multiplicity s of 


so that 


then 


P(a) = o, 


P*(3) = 0, 

P*(q) ^ 0, 


(* = 0 , 1 , •••,$ - 1 ) 


P(h)~F(t,q) =0. (k = 0,1, .--,5 - 1) 

where the last expression is short for [ [P(h ) ^ Fit, q)~]q=.q 
Actually 


(70) 

(71) 


P(h) ~ F = ~ P(h)F(t, q) = lP(q)F(t, g)] 

= pw «, 3 ) + § q PH , §) + ••• 

+ -P(S) fqk F(t, i) 

= 0, (k< s) (72) 

term for term because of (70). 

Of course, any linear combinations of special solutions of this 
form are also solutions. Every polynomial has n roots, when re- 
peated roots are counted according to their multiplicity; and every 
distinct root yields as many special solutions of the form given in 
(71) as the multiplicity of the root. It follows that we have 

r 

D Mi = n 
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special solutions, not yet shown to be all linearly independent. 
We take the most general linear combination of these 

y = C 10 F(t, 21) + C„ §- q Fit, 2O + • • • + ^ F(*, 21) + 

S ^ 

C 2 oF(t,q 2 ) + C n ^ F(t, q 2 ) -1 h C 2 , mr . 1 F(t, g 2 ) + 


CroF(t , 2r) d~ 6Vl ^ F(b gr) "T ‘ " 1 4" C T ,m r - 1 1 fZr) 

- E E c k.ih F (t,qk), (73) 

t=i i=o aq 

where ( 21 , g 2 , • • • , g r ) are the distinct roots of P with respective 
multiplicities (»i, w 2 , • • • , w r ). This linear combination is a solu- 
tion of the reduced equation. It remains to determine the n co- 
efficients (C 10 , • • • , C T , m - 1 ) so as to satisfy the n arbitrary initial 
conditions in (56). Thus 

t m k —l 

h l E E C kj Fit, q k ) = &«, (* = 0, 1, ■ • * - 1) (74) 

*=1 j=o »q 

or because of (66) and (69) 

E E 1 C w QzPF(/o, 2 *)] = h. d = 0, 1 , • • • , » - 1) (75) 

t=l j = 0 (72 

From (64), F(/ 0 , g) is unity regardless of q, so we have 

r 7»i.— 1 

E E C w (2fc*) 0) = fc, (* = 0, 1, • • •, n - 1) (76) 

k=l 2=0 


where 
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When all of the roots are distinct, the determinant of this 
system takes the familiar Vandermonde form 


2i 

Si 2 


lei 


re -1 


1 

2* 


q 2" 


1 

2» 

e« 2 


2n ; 


n-l 


= II ( 2i 

i>j 


2 /). 


(78) 


which does not vanish. Consequently the « equations can be 
solved to yield the » C’s so as to satisfy the n initial conditions. 
The proof of the identity (78) follows from the fact that the de- 
terminant is a polynomial of the (n — l)th degree in terms of any 
gi, and vanishes for any q, equal to g». This plus the essential 
symmetry in terms of the q’s leads to the right-hand side of (78). 
An alternative proof would use induction. 

In the general case of repeated roots, the determinant becomes 


v= 


it 


0 

1 

2gi 


1 




)<'> 




gl »-i (re — l) g i ” -2 ••• ( 9 i n ~ 1 ) t “>- 1> q-f- 




(£ 2 n-I)(m 2 ~l) 


1 

g.r 


Qr l 


(5/)&» r-U 


(9 r "~ l )( m r~V 


(79) 


no two q’s being equal. This generalized Vandermonde determi- 
nant can be shown not to vanish. 10 This may be proved by induc- 
tion, but for the present purpose we may omit the proof of this 
fact, and simply note its consequence : all of our constants can be 
uniquely determined. 

Here I should like to point out that the inversion of a general- 
ized Vandermonde matrix can be done once and for all, and the 
result is applicable against varying initial conditions. The exact 
form of the inversion is expedited by synthetic division and is 
closely related to contour integrations and the expansion of a poly- 

10 P. A. Samuelson, “A Simple Method of Interpolation,” Proceedings of the National 
Academy of Sciences, XXIX (1943), 397-401. Also F. R. Moulton, Differential Equa- 
tions (New York, 1930), chap. xv. 
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nomial in partial fractions, so that the classical and operational 
methods meet. 11 

26. I turn now to the problem of determining the unreduced 
special solutions with vanishing initial conditions. In the proof 
of our fundamental existence theorem we have seen one way that 
this can be built up out of the basic solution of the first order 
equation Fit, q, 0 ;/)*. Once the latter is tabulated or catalogued 
for a great variety of the /’ s that arise in practice, the remaining 
work is simply that of calculating 

F(t, qi, 0 ‘,f)h = Z\, 

F{t, q h q 2 , 0;f) h = F{t , q 2 , 0; Z^) h = Z 2 , 

Fit, Su qi, Ss, 0 ;f)h = F(t, q it 0;Z 2 ) h = Z 3 

( 80 ) 


Fit, q 1 , q 2 , * , q-n, 0 \f)h Fit, q n , Oj Z n — 1 )& Z n . 


The last is the required solution. 

In this section an alternative procedure is described whereby 
our special solution with vanishing initial conditions is built up out 
of the first order solution, Fit, q, Q;f)h- First, let us recall the 
algebraic fact that any rational expression of the form 


1 = 1 

rr ix - qir - 


T 


GC Mi = ri) 
1 


( 81 ) 


can be split up into partial fractions of the form 

1 A 11 , A 12 , , Auni 


P(x) 


ix 

+ 


2i) 

A 21 




(x - q 2 ) 


(x 

+ 


Si ) 2 
•• + 


+ 


+ 


(x - q 1 ) 


7»1 


in 


(x - q r ) 


+ ••••+• 


A r 


(x — q r Y 


(82) 


In the usual case where all the roots are distinct, the coefficients 
are simply 1 /P'iqi), respectively. When we have a repeated root, 
the coefficients are given by 


1 


imi - j ) 


r d m *~ s (x - gi) m o 
!L dx^ P(x) W 


u P. A. Samuelson, “Efficient Computation of the Latent Vectors of a Matrix,’ 1 
Proceedings of the National Academy of Sciences , XXIX (1943), 393-397. 
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Numerically these can. be evaluated fairly quickly by synthetic 
division. 

The following important identity is an immediate consequence 


of (82). 


7 mi 

1 =11 An 

2=1 j = 1 


P(x) 

(x — qi) j 


(84) 


Let us return now to the problem of finding a solution to the 
unreduced equation with vanishing initial conditions. In the first 
order case, the expression 



(85) 


has as yet no meaning. However, it looks like the solution to our 
equation, or would if operator expressions could be treated like any 
algebraic quantities. For the first time we are in a position to give 
a meaning to negative powers of h or rational expressions in h. We 
shall define the above expression to be 

= F(t,q,0;f) h . (86) 

Immediately [1 /P(h)~\f(t) suggests itself as the solution to the nth 
order equation. But how is it to be interpreted? The use of 
partial fractions comes to mind, and so we may try for our solution 
in the case that all roots are simple 

y = AuF(t, gi, 0’,J)h + AnF(t, g 2 , 0 ;f)h + • * • 

+ A n \F(t, q n , 0;f)h, (87) 

or in the general case 

r mi 

y=IE AijFj(t, gi, 0 ;f)h, (88) 

i—1 j—1 

where we define 

Fi(t, S* 0;f)h = F(t, q it 0 ;/)», (h - q i )F 1 = / 

Ht, S i, 0;f)h = F(t, q it 0; FJ it ( h - = / 

(89) 

F a (t, qi, 0 ;f) h = F(t, q u 0; F a - i) h . ( h — qi) a F 3 == / 

Clearly (88) satisfies the vanishing initial conditions of the 
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problem. After substitution into the wth order equation we have 

my - E 2 !<■ 0;/)» 


r 

= EIA 

2 = 1 / = 1 

= 1/(0 


f PW 1 

L(* -?<) y J 


/ 


(90) 


by virtue of the identity (84). 

Note that our discussion of negative operators was purely 
heuristic and not germane to our proof. The expression in brackets 
does not involve negative operators although notationally it appears 
to do so. 

This completes our second construction of the unreduced equa- 
tion with vanishing initial conditions. 

27. The elementary mathematical literature on differential 
equations abounds with trick methods for arriving at particular 
solutions of the unreduced equation for special f’s (e.g., poly- 
nomials, exponentials, trigonometric functions, etc.). At this 
point it is only necessary to discuss two such methods. 

The first is a formal series expansion, of even greater importance 
in the field of difference equations than differential equations where 
convergence is not always certain. Let us write the formal 
expression 

1 1 

■37 7 = j j ; ~ — j(3o -J- ftx -f- ftx 2 -f- * ■ * . 

P(x) a 0 + aiX + * • • + <x n x n 

(a 0 5* 0) (91) 


Multiply through by P(x) and equate coefficients of like terms to 
get equations determining the infinite sequence of fts in terms of 
the a’s. 




(onffo) 

ao ’ 

(a igi + O2§o) 


( 92 ) 


ft = 


(aift-l + 0*2 ft- a + • • • 4- OLnfi 0 ) 


ao 
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In practice there are a variety of ways of determining the (3 se- 
quence from the a’s (algebraic division, difference equations, 
Taylor’s expansion of 1 /P(x), determinants, etc.). 

Formally, a particular solution of 

P{h)y = / 

is given by the infinite series 

y-m } ~ (93> 

where 1/P is now defined differently from (86) and (87). Whether 
there is any meaning to the right-hand side, that is, whether the 
series is defined and convergent or not, depends upon the h, the /, 
and the a’s. Thus, 

( ai D 4-1 )y=f (94) 

has a finite convergent solution for / equal to t s of the form 

y = 1/ - «i Df 4- ai 2 D 2 f - ai 3 D 3 f + •••+(- ai yk*f 
= i 3 - ai 2 f* + a^At -a/440-04 "-. ^ ] 

While for / equal to 4', we have a formal series 


y = X) (— a-i) l D l e u 
o 


= £ (- «iX) 1 ', (96) 

o 

which converges for |aiX| less than one, but seems to diverge 
otherwise. However, if we use the “formal” identity 


£ (- a{ky = 


o 


1 

1 + «i\ ’ 


(97) 


we get the correct answer, 4 ! (1 4* ax\)“\ although not always by 
a legitimate path. 

Let us write a difference equation in the form 

«• yit) + «1 y(t - 1) 4- • • • 4- oc n y(t -n)= P(E~ l )y(t) = fit), 

P°y(0) « y(0) = 0, 

Er*y{ 0 ) = 0 , 


( 98 ) 
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Then 

y(t) = P(h=Tj m = ( 99 ) 

This will always terminate after t terms, and will satisfy the equation 
and initial conditions. 

Exercises: 

1. Show that fit is a solution of the equation 

P(E~ l )y — a 0 y(t) + ai y(t — !) + *•* + a n y(t — n) = 0, 

y(0) = i, 

y(- l) = o, 


3 '( — n + 1) = 0. 

2. If P(x) has n simple roots, gr 1 , and R(x) = a Q ~~h n P(x~' 1 ) f show that 

1 


3 ‘ ? R'bi) 


(it) 




3. If a t equals a l /t\ y what is fit ? (Hint: Consider series expansion of e a 
and \/e a .) 

4. Write fit in determinantal form. 

5. Write out the first three fi’s in terms of ot s. 


28. The second artifice for arriving at a particular solution 
comes into use when f(t) is itself the solution of a reduced first order 
equation. That is 


where 


P(h)y = / 

hf = qf. (100) 


We try a solution in the form 

y = cJ> 


( 101 ) 


where c is a constant. We have after substitution and cancellation 

of/ 



cP(q) = 1. 

is not a root of P, this has a solution, 

1 



( 102 ) 

( 103 ) 
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In the important electrical engineering studies of frequency re- 
sponse, c is the transfer function; for q a pure imaginary, it is an 
impedance. 

If q is a root of P, we have the phenomenon of resonance, and 
we must try for our solution 


C d m f(t) 

dq m 


(104) 


where m + 1 is the multiplicity of the root q. After substitution 
and cancellation we find 

1 


c = 


P m {q) 


(105) 


If / is composed of two or more components, /i,/ 2 , • • • , which 
are each solutions of the first order reduced equation with q equal 
to $i, g 2 , • • • , by superposition we find a special solution of the 
unreduced equation to be 


;/i + + 


P{qr) Jl ^ P{qf)' 


(106) 


If any of the q’s are roots of P, the modification given in (104) will 
of course be in order. 

Important examples are provided by periodic driving forces in 
the field of differential or difference equations. Any well-behaved 
periodic function can be written as a Fourier series, of which each 
term satisfies a first order differential (difference) equation with 
pure imaginary (complex root of unit modulus) coefficient. Unless 
perfect resonance occurs, as it cannot in a damped system, our 
solution can be built up term by term of the same harmonic ele- 
ments but with coefficients differing by the factor, 1/P(<Z;). The 
resulting solution (if it converges) will be a periodic function of 
time of the same period as the driving force. 

Finally, if our f(i) can be written as a general integral 


fit) = f F(t, q)dW(q), 


( 107 ) 


where 


hF(t, q) - qF(t, q), 
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and c represents some contour in the complex plane, then we try 
the formal solution 

y = f'F(t,q)j^dW{q). (108) 

In cases where P(q) vanishes, this may be an improper integral 
whose limit nevertheless often provides a satisfactory solution. 

The mathematician will recognize that in the field of differential 
equations, all the fs that are likely to arise in practice can be 
written in the form (107), by setting q equal to ia, and F equal to 
e iat , and using Fourier’s integral theorem. 

29. The artifice of the Laplace transform will now be general- 
ized to a wide class of simple operators other than D. The Laplace 
transform is defined as 

J f*co 

1 e~ qa y(a)da, (109) 

0 

which will be convergent for a wide class of y’s, for the real part 
of q less than some constant. 

The fundamental property of the Laplace transform which 
makes it a useful artifice in differential equations is as follows : 

£(q;Dy) = q£(q;y) - y(0), 

£(q;D 2 y) = q 2 £(q;y) ~ qy(0) - Dy (0), 


(HO) 


£(q;D n y ) = q£(q; D n ~ l y) — D n y(0) 

= q n £(q;y) — q n ~ l y{ 0) — q n ~ 2 Dy{ 0) — • • • 

— qD^yifi) — D n y{ 0). 

For then taking the Laplace transform of both sides of 

P(D)y = 'Za i D”- i y= /(<), 

0 

D h y{ 0)=b k , (£ = 0,1, •••,«- 1) 

we get 

P(q)£(q; y) = £(q;f) + bo(a 0 q n ~ 1 + aig" -2 -1 F a n - 1 ) 

+ bi(doq n ~ 2 +■•••+ d n —i) + • • * 

-j- b n -i{doq + di) +• b n -ia<i. (112) 


(HI) 



4x6 


FOUNDATIONS OF ECONOMIC ANALYSIS 


This is an expression in q and involves no operators. Solving 
for the expression involving the unknown function, we have 


p ,„ . x _ ^(£) , £ (g:/) _ ffn) 

<c(g,y) - + p( g ) 7( -g)’ 


(113) 


where R is a polynomial involving the initial conditions, b k , and 
is less than n in degree. Because / is known, the Laplace trans- 
form of our unknown function is determined. Under general 
mathematical conditions we can prove that our solution y is “essen- 
tially” unique once its Laplace transform is determined. One 
method of inversion is given by contour integration 


Y(q)e qt dq, 


(114) 


2 TriJ c _ ix 

where the contour is to the right of all poles of F. 

In any case, once Y(q) is determined, the determination of y(t) 
is a detail which need not concern us. Extensive tables are avail- 
able to aid in inverting Laplace transforms involving a wide class 
of / s. 

To extend the theory we need only seek a junctional appropriate 
to any admissible h , with the fundamental property 

£(g; hy) h = q£(q;y)h - hy(to). (115) 

The following expression formally answers our problem 

F(t, q, 0 \f)h 


= 

where q and / are such that 

lim 


F(t, q, 1 ; 0) A ’ 


m 


0. 


(116) 


(117) 


i“oo F(t, q, 1;0)* 

The proof that this is our generalized Laplace transform is given 
as follows : 

(U8) 


By definition 
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and each of these systems has unique solutions. Now it can be 
verified that 

F{t, q, 0; hf) k s hF(t, q, 0 ;/)* - q, 1 ; 0)„, (120) 

because 

hF(t 0 , q, 0 ;j) h - fit 0 )F(t 0 , q, 1 ; 0)* = f(h) - /(*„) = 0, (121) 

and 


(h - q)lhF(t, q, 0 ;f) h - f(h)F{t, q, 1; 0) A ] 

= hl(h ~ q)F(t, q, 0;/) A ] - - q)F(t, q, 1; 0) A 

= h£f] - 0. (122) 

Therefore, 




m] 


m 


- i: m n EMi £1 1 

jt — >00 F(t, q, 1 ; 0) h t — >00 Fit, q, 1; 0)* 

= 2 ^( 2 ; /)* + 0 - fih) 


■ fit 0 ) 


(123) 


for q and / limited as in our hypotheses. Q. E. D. 

Thus, the solution to the general nth order problem with pre- 
scribed boundary conditions is given by the inversion of 


or formally 


£(g; 30 a 

y{t) = £- 



£jgjj% 

m 

+ £ _1 [ 


= Y(q) h , 

MqjJhl 

Pi 2) J' 


(124) 

(125) 


Moreover, even where there is no convergence of £, "<£” can be 
“inverted” as the mathematically inclined reader may work out 
for himself. So long as fit) leads to a unique solution of the first 
order operator equation, its behavior at infinity is irrelevant. 

Exercises: 

1. Show that <£({?; j)e — 2D = Y (s)s, where to — 0 . 

0 2 

2 . What is £(g; y)cf What is £(g; y)<z>? 
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3 . Make and verify a table of transforms for E as follows: 


y(t) 


a 1 = F(t, a, 1 ; 0 )h 

1 

q — a 

ta *- 1 

(ihr 

t{t - 1) ••• (i - n + 1 )a‘~ n 

/ 1 y+i 

n\ 

\2 ~ a) 

F(t , a, OjyOA 

Y 1 (q) h 
q — a 

1 

L yS)yi(t - l - ») 

0 

fi (ff)jvF,( S )jr 

3 n y 0 , a) 

d"F(g, a) a 

da" 

da n 


? 


4 . Trace the relationship between the “classical” and the generalized 
Laplace transform method. 

Many Variable Systems 

30. Let us return now to linear systems of many variables 
(yu J 2 , • • ym). For the dynamic system to be determinate, we 
must have at least m operator equations. These can, of course, be 
of high order, say n or less, and can be written as 

a^nh n yi + ^o,i 2 ^ 7 l y 2 + • • * + #i,n fe n-1 yi + • • • 

" 4 “ ^n,lmym. “ 

a 0 ,2ih n yi + #o,22^ w 3 ; 2 + * - * + #i,2ife n, ~ i yi + * * • 

+ G'n % 2my in :=: J^2 00 j 

( 126 ) 

ao f mih n yi + # o , m 2 ^ 3 ; 2 +■*** + ai t mih n ~ 1 yi + * * * 

+ &n,mmym 555 fm(f) j 

or as 

P n{h)yi + Pi2{h)y 2 + * * * + Pim(h)y m = 
p2i(h)yi + P22(h)y% +*••• + Pim{h)y m = / 2 OO, 


PmlWyi + Pm2(h)y 2 + * • • + Pmm(h)y m = 
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where P a (h) are polynomials in h of the nth degree or less; or in 
matrix notation as 

a Q h n y + aih n ~ l y + • • • a n -ihy + a n y - jit) 

and 

Py=f, 

where (ho, • • • , a n ) are m 2 matrices, y and f column matrices of m 
elements. Of course, many of the elements in the a’s may consist 
of zeros. 

The maximum order of our system will be nm ; but its actual 
order may well fall short of this. By the actual order of the system 
we mean the maximum number of initial conditions which can be 
arbitrarily imposed on the system. 12 

As an exercise in matrix notation the reader may work out the 
demonstration that by redefinition of variables 13 the system can 
be written as a first order one, 

AohY + A.Y^F, (127) 

where 


I 

0 

... 0 

0 

0 

1 

... 0 

0 

0 

0 

... 0 

I 

.0 

0 

... 0 

a o 


0 

-i 

... 0 

0 


0 

0 

0 

... 0 

0 


0 


. 

... 

• 

, F = 

• 

. 

' . 

... 

. 


. 

0 

G n 

0 

1 

... 0 

* * • &2 

-I 
a i. 




“It can be shown that the order of the system, s, is equal to the degree in g of 
the determinant 

A(q) ~ | a 0 q n +' aiq n ~ l 4“ ** * + a»|. 

See R. A. Frazer, W. J. Duncan, and A, R. Collar, Elementary Matrices and Some 
Applications to Dynamics and Differential Equations (Cambridge, 1938) for references. 
See par. 5.4. 

13 Frazer, Duncan, and Collar, Elementary Matrices i par. 5.5. 
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Although our system has been converted to a first order system, 
it is not yet in “normal” form with h Y expressed explicitly in terms 
of Y and functions of time, 

hY = AY+f. (128) 

If A o is non-singular, that is, if its determinant does not vanish, 
we can premultiply both sides of (127) by A f) ~ 1 and immediately 
convert to normal form, 

hY = — (.<4<rh4i)F -f- (Ao-'F). (129) 

Similarly in (126) if a 0 is non-singular, our operators of highest 
order can be solved for in terms of those of lower order by the act 
of premultiplication with a 0 _1 . 

If a 0 is singular, our system is not well-specified. One of three 
situations must then prevail : (1) there exist purely statical, algebraic 
relationships between some of the variables. By means of these 
relations one or more variables can be eliminated, and the leading 
matrix of the resulting smaller system will then be non-singular; 
(2) the relationships are not all independent so that a unique solu- 
tion does not exist. Arbitrary functions of time will enter into 
the solution. (Example: A system with two identical equations, 
permitting one variable to be chosen as an arbitrary function of 
time) ; (3) the system is inconsistent. 

31. It is up to the economist to present a well -specified dy- 
namical system. In what follows we shall assume that this task 
has been adequately performed so that whenever desirable our 
system can be written in the normal first order form of order s. 

hy = Ay + f, (130) 

with initial conditions 

y(h) — b. 

As in previous sections we shall find our solution in two steps: 
first, the solution of the reduced or homogeneous system 

hy — Ay, (131) 

y{to) = h ; 

and then the solution of the unreduced system 

hy = Ay + f, 
y(t 0 ) = 0. 


( 132 ) 


DIFFERENCE EQUATIONS 42 1 

The sum of these two solutions will be the solution to our general 
system (130). 

32. Earlier we have seen that the solutions of the simple first 
order equation in one variable provided the building blocks for 
the solutions of more complicated cases. We try, therefore, a 
solution of the form 

y\ = CiF(t, q, 1 ; 0 ) h = c^Fit, q), 


(133) 


y» — c s F(t, q, 1; 0)^ — c t F(t, q ), 

where F is as defined in earlier sections, and the e’s and q are un- 
known constants. After substituting into (131), we get by means 
of the fact that hF equals qF, 

qC\F — AiiCiF + A12C2F -j- • • • -f AuC 3 F, 
qc^F = Az\C\F + A22C2F -j- • ■ • + A^c a F, 

(134) 

qc s F — AaiCiF -f- AsiCzF -}-•• • -f- A iS c s F. 

The common factor F may be canceled from both sides of the 
equation, leaving us with 5 algebraic equations which do not involve 
time. If these equations can be satisfied, we shall have a solution 
of the operator equation, although not necessarily the solution 
with appropriate initial conditions. 

Our linear equations involve known ^4’s, undetermined c’s and q. 
Our equations are homogeneous in the c’s and may be written as 

(.4ii — q)ci + -di2C2 + — + AuCs — 0, 

A 21 C 1 + (d 22 - g)c 2 + •• • + Ai S c a — 0, 

(135) 

A a \Ci + AaiCz +•••-+- {A as ~ i)Ca = 0 . 

If all the c’s are zero, we have a trivial solution of the operator 
equation. When can we find a non-trivial solution? From the 
theory of ordinary linear equations (Cramer’s rule, etc.), we know 
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that a system of homogeneous linear equations has a non-trivial 
solution if, and only if, its determinant vanishes; in this case if 


An ~ 2 

A 12 

• ■ • A is 

-421 

4 22 2 

* * * A 2s 


4s2 

* * * A ss 2 


= 7rog S + Trig 5 " 1 + ' • ' + ITs-lg + TTs — 0. (136) 

For a q chosen at random the determinant will not vanish, and 
we shall have no non-trivial solution. But q is at our disposal ; we 
select it to make the determinant vanish. The determinant is a 
polynomial of the 5th degree in q, and is called the characteristic 
equation of the matrix (or secular equation). The roots of the 
characteristic equation must necessarily be 5 in number (counting 
multiplicities). They are called characteristic or latent roots of 
the matrix. If we designate them by (q%, q s ), we can write 

A(a) = (21 - 2)(2* - 2) • • ' (2. “ 2) = 0. (137) 

It is not easy in fact to find the characteristic roots of a matrix. 
From the definition of a determinant, the coefficients of the charac- 
teristic polynomial can be determined as the sums of principal 
minors of a given order (cf. Mathematical Appendix A, p. 374), and 
any of the customary methods for solving a polynomial for its 
roots may then be employed. Even with the best short cuts, 14 the 
computations are tedious. Fortunately, the economist does not 
always require the quantitative values of the roots. 

Let us return to the problem of finding solutions to our reduced 
operator system. If we put a characteristic root, q it into the F in 
(133), and take for our c’ s a non-vanishing solution of (135), we 
shall have a solution which satisfies our reduced system. Note 
that our selection of c’s is not unique ; doubling each will also give 
us a solution. 

A column of c’s satisfying the homogeneous equations is called 
a characteristic vector or latent vector or sometimes a modal column 
or normal component. Some of the characteristic roots may be 


a 


14 Frazer, Duncan, and Collar, Elementary Matrices , chaps, iv and v: 
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complex; the characteristic vectors will then be complex. Of 
course, conjugate complex magnitudes can always be combined so 
as to give only real quantities in the final answer. 

If a root qi is simple, then A '(g f ) is not equal to zero, and the 
rank of [A — qJ2 is (s — 1). Consequently, the latent vector 
corresponding to qi is unique except for scale factor. We may dis- 
pose of this ambiguity by some normalization convention. 

If all roots are distinct, the general solution can be simply- 
written. Where roots are repeated, certain complications occur. 
However, these are not intrinsic and rarely concern the economist. 
Later, I show how these difficulties can be avoided. But for true 
mathematical insight the reader must be referred to the theory of 
elementary factors and divisors. 15 

If the ^ roots are simple, and the s latent vectors have been 
normalized, we have the following particular solutions of the re- 


duced equation (131): 

c U iF(t, qi); 
Cn,iF(t, qi); 

Ci,zF(t, q 2 ) ; 
Ci,iF{t, q 2 )\ 

Ci, s F(t, qf); 

••• Ci, s F(t,q a ); 

(138) 

Cs,iF(t, qi ) ; 

c*,zF{t , q 2 ); 

■ ■ * qf)) 



where F{t, q) is an abbreviation for F(t, q, 1 ; 0) h . Any linear com- 
bination of these is a solution, or 

Ji = K\Cj,\F{t, qf) + KiCj,zF(t, qf) -f- • • • + K s Cj, s F(t, q„). 

0 ' = 1 , •••,*) ( 139 ) 

Note that the c’s and q’s depend only on the matrix and not on 
the initial conditions. But the K’s must be determined by the s 
initial conditions. Thus, 

3'i(0) = Ki c i,i + Ktfi.z + • • • + K S C\, S = bi, 

3 * 2 ( 0 ) = KiC 2 ,i + Kv.Cz, 2 + • • • + K a Ci, a = bi, 

; (140) 

y s (0) = KiC s ,i + KiCt'i + • • • + KsCs.s — b„ 

15 M. Bocher, Introduction to Higher Algebra (New York, 1933). 
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or 

cK = b. 

These equations are sufficient to solve for the K’s in terms of the 
b’s if the matrix c is non-singular. The reader may be referred to 
any mathematical text for the proof that c is non-singular, and that 

c-'Ac = [ 2i 5 t7 ]. 16 (141) 

In the case of distinct roots, the solution to the reduced system 
(131) can be written in compact form as 17 

y = c[F(t, q % )bif]c~ l b. (142) 

33. If b is the identity matrix, we have a fundamental set of 
solutions. This may be written formally as Fit, A), either as an 
extension of Sylvester’s Law of Latency mentioned in Mathe- 
matical Appendix A, footnote 13, or from the definition 

F(t, A) = Zm(t)A { , (143) 

o 

where the m' s are defined by the formal identity 

Fit, S ) = E mi(t)q\ (144) 

o 

Because of the property 

hF = qF, (145) 

h E w i (0s i = E miit)q i+ \ (146) 

0 0 

and we have the formal identities 

hmiit) = mi-iit), ii = 1, • • •), (147) 

and hence 

hFit, A) = qFit, A). (148) 

Also 

Fih, A) E 7. (149) 

Formally, these above relations hold even if there are equal 
roots, The formal solution given in this section is sometimes a 

w In Mathematical Appendix A this fact is used in connection with symmetric 
matrices, for which c is orthogonal; i.e., c~ l ~ c\ 

17 Frazer, Duncan, and Collar, Elementary Matrices , chap. v« 
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very convenient one. For the differential equation system 

By = Ay, (ISO) 

F(t, A, 1 ; 0)z> takes the form 

F d = eA* = I + A ji + A* •••. (151) 

For the system 

Ey = Ay, (152) 

we have 

F{t,A, 1;0), = AK (153) 

However, for stability investigations these solutions are not very 

convenient. 

34. If instead of the roots all being simple, some are repeated, 
we may write 


Ms) = (si - s) mi (s* - s) mi • • • (2r - q) mr . 


(154) 


where (gi, • • • , g r ) are the r distinct roots with respective multi- 
plicities (mi, • • *, m T ). 

As before, there are special solutions of the form, CiF(t, qj), but 
the c’s are no longer unique except for scale factor; in fact, even if 
the c’s were uniquely defined, there would not be a sufficient num- 
ber of particular solutions to combine to get a general solution for 
s arbitrary initial conditions. 

Earlier, in the case of multiple roots we tried solutions of the 


form, 


9F(t, gi) d 2 F(t, gi) 


d”*-iF(t, gj) 


dg ’ 3g 2 ’ ’ 9g m<-3 

Let R(q) be the ad jugate of the g matrix, 

P(g) = A - gi; 


, and so we shall here. 


(155) 


that is, R(q) is the transpose of the matrix made up of the cofactors 

ofP(g). 

By means of algebraic theory of no interest here, it could be 


can be 


shown 18 that from the columns of the matrix j 

selected vectors which when multiplied by the partial derivatives of 
F will give a complete solution. However, these difficulties can be 

18 Frazer, Duncan, and Collar, Elementary Matrices , par. 5.10. 
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side-stepped by the device of reducing the 5 first order equations in 
5 different variables to single 5th order equations in each variable. 19 
Thus if 

hy = Ay, (156) 

y(to) = b, 

then 

hy(t 0 ) — Ab, 
h 2 y(t 0 ) = A 2 b, 

(157) 

A !-1 y(f 0 ) = A' , ~ 1 b. 

Because of the Cayley-Hamilton Theorem that every matrix satis- 
fies its own characteristic equation or 

A(A) = 0, (158) 

we find from (156) that 

A (k)y — (ir oh s + + • * • + x s )y = A(A)y = 0. (159) 

Hence, each variable, say y,{t) , satisfies an 5th order equation. We 
also know from (157) the values of Qy, •(£<>), hyj(t 0 ), • • • , h a ~ 1 y ] (to)2- 
Hence, we have a well-determined system whose solution can be 
found by the methods of sections 24, 25, and 26. It is clear then 
that our final solutions take the form 


3 >i 00 


= Cj,nF(t % g_i) + Cj t i 2 
+ Cj f 2 iF(l, q 2 ) + 6j\22 


dF(t, gi) 

dq 

bFih gg) 

dq 






qi) 

dq ™'- 1 

q a ) 

dq ™* -1 


(160) 


4* c i,r\F(t, q r ) + Cj,r 2 


9F(t, gr) 

dq 


j_ 1 _ d m ’— 1 Fjt, g r ) 

‘ ‘ C i,rm r ~l Qqm r - 1 ’ 

0 = 1 , •• -,s) 


19 The method of attack suggested here is not only theoretically simple but is also 
computationally optimal. See P. A. Samuelson, “Efficient Computation of the Latent 
Vectors of a Matrix,” 'Proceedings of the National Academy of Sciences, XXIX (1943), 
393-397. 
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where the s~ c’s are not all independent. All but 5 are determinable 
from the matrix above. The remaining .? depend on the 5 initial 
conditions. 

Thus, terms of the form dF k / dq k may appear if there are multiple 
roots. But they need not appear even if there are multiple roots. 
That is, whatever the initial condition b, the coefficients Cj, ik may 
be identically zero for (1 < & < mi). For example, in the matrix 


A = l- 35 tJ ], 


(161) 


— 3 is a repeated root of multiplicity 5. F{t, — 3) appears in the 
final solutions, but not dF(t, — 3)/dq, or any higher derivatives. 
The reader must again be referred to the theory of elementary 
divisors for insight into this possibility. 

As far as stability is concerned, except for borderline cases 
(which we have excluded in our strong definition of stability), it is 
clear that 

lim F{t, q) = 0 

implies 


lim 

t—* 00 


dF 

dq 


= lim lim 

t-~+ 00 Aq -+0 


F(f, q + Ag) - F(t, g) 

Aq 


lim 0 = 0. (162) 

A 


Therefore, it does not matter whether or not the partial derivatives 
of F are suppressed. 20 

35. We have now given the complete solution of the reduced 
equation. By the same method as the last section we may indicate 
the solution of the unreduced part. 

hy = Ay + /, (163) 

y(to) = 0. 

20 A borderline case on the boundary between stability and instability is provided 
by a symmetrical conservative physical system. Here F(t, iq)n — e^ 1 and is neither 

dF 

damped nor undamped. However, — = tie'* 1 is explosive, as is well known from the 

theory of resonance. But the autonomous reduced system y - Ay will not have ex- 
plosive solutions when there are multiple roots. Even Lagrange overlooked the possi- 
bility that the coefficients of the terms tie** 1 would vanish. In the nineteenth century 
it was shown that the elementary divisors of a symmetric negative definite matrix are 
all linear, and that such terms are actually suppressed. Only an external force can cause 
resonance of the above type in a conservative system. 
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We easily see that 

h°y ~ y, 
hy = Ay + /, 

h 2 y = Ahy + hf = A 2 y + Af + hf, 
h*y = A*y + A 2 f + Ahf + h 2 f, 


( 164 ) 


h s-ly = A e-ly _j_ A s-2j J_ A s-S : h f 4 + AJt^f + k*~ 2 f, 

h*y — A s y + A s-1 / + • • • + Ah*~ 2 f + h*~ x f. 

Multiplying the first line by 7r s , the second by t s -i, the last by to, 
and adding, we have 

A (h)y — &{A)y + f + mi hf + • • • (165) 

where the first term on the right-hand side vanishes by the Cayley- 
Hamilton theorem, and where the m’s are linear combinations of 
powers of A. 

Each of the variables (yi, • • • , y m ) must satisfy an unreduced 
5th order equation. The right-hand driving functions seem to , 
depend upon hf, h 2 f as well as /; and yet, we have nowhere made 
the assumption that expressions of the form h k f need even be de- 
fined. If we write out a typical equation, say in y i, the paradox 
will be dispelled. We have 


or 


myi = RiW, 
Rim 


(166) 




m 


Because Ri is of degree (5 — 1) at most, it will be clear that 
(by partial fraction expansion or otherwise) we can avoid all terms 
of the form h k f. Only expressions of the type F(t, q,b\f)h will 
appear. 

We have now only to specify the correct initial conditions. 
Because 

A k y(t 0 ) =0, (k = 0, !,-••) 


M A 
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our initial conditions should be 

hy(t 0 ) = f(t 0 ), 
h 2 y(t 0 ) = Af(t 0 ) + hf(t 0 ), 


(16.7) 


0 ) = + A°~%f(t 0 ) + ■■• 

+ Ah°-*f(to) + 

We now have an unreduced 5th order equation with prescribed 
initial conditions which each variable must satisfy. Our problem 
has therefore been reduced to that of earlier sections, which has 
been completely solved. 

Exercises: 

1. Show that I e A( ' t ~ t *~ w) f(w)dw upon proper definition becomes the solu- 

J to 

tion of ( 132 ), when h is D. 

t- 1 

2. Show that Y, ^ performs the same role for h equal to E, 4 = 0. 

o 

3 . By using the generalized Laplace transform &(q;y)h, defined earlier, 
go from 

ky = Ay + /, 

y(k) = & 

to 

where [T*j(2)/A(2)] equals [_ql — a] -1 and consists of rational functions whose 
numerator is of lower degree than denominator. Show that expressions of the 
form h k f(t) do not enter into the solution in an essential way. 

36. We have already seen in Section 21 of this Appendix that 
it is necessary and sufficient for stability of a differential equation 
system with constant coefficients that the real parts of the roots 
of the characteristic polynomial should all be negative, i.e., the 
roots must all be in the left-hand half of the complex plane. Other- 
wise we should not have the required dampening factors in our 
exponential solutions. 

Fortunately, it is not necessary to solve the characteristic equa- 
tion for its roots to determine whether these stability conditions 
are satisfied. There exist known simple necessary and sufficient 
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conditions on the coefficients of the polynomial which guarantee 
stability. These are known as the Routh-Hurwitz conditions. 21 
Let our general polynomial of the «th degree be written as 

/(X) = aoX n + a\X n ~ l +•••■+• a,n-iX + a n = 0, (168) 

where a o can always be regarded as positive, and will often be taken 
to be unity for convenience. 

If the a coefficients are all real, it is a familiar fact that such 
complex roots as occur will appear in conjugate pairs, ( u + iv) and 

n 

{u — iv). If we write the polynomial as the product, ooll (X — X.) , 

these complex roots will combine to form real quadratic factors. 
It follows, therefore, that every polynomial can be written as the 
product of quadratic factors and linear factors, 

/(X) = a 0 (X + mJ(X + «,)••■ (X + «,) 

(X 2 + biX + d)(X* + b 2 X + a) • • • . (169) 


The linear factors can be grouped into quadratic factors except 
when n is odd; in which case / can be written as a product of 
quadratic factors and at most one linear factor. 

Now the roots of / cannot be “stable,” unless the roots of each 
linear and quadratic factor are stable. We must investigate then 
these simple cases, which can fortunately be treated exhaustively 
in a simple way. The linear factor 


X + m i = 0 


clearly has a Routhian root (one whose real part is negative) only 
if its coefficients are of the same sign. For — mi is its root, and 
nti must hence be positive. 

A quadratic factor of the form 

X 2 + bX + c = 0 
has roots which can be written as 



— b ± V& 2 — 4c 
2 


(170) 


21 E. J. Routh, Stability of Given State of Motion (London, 1877), Adams Prize Essay; 
Dynamics of a System of Rigid Bodies (6th ed.; London, 1930), part II, chap, vi; A. 
Hurwitz, “Ueber die Bedingungen, unter welchen eine Gleichung nur Wurzeln mit 
negativen reellen Theiien besitzt,” Mathematische Annalen, XLVI (1895), 273-284. 
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If b 2 — 4 c is greater than or equal to zero, the roots are real and 
the quadratic expression can be factored into the product of two 
linear expressions, ( X + mf)(X + mi)- It is obviously necessary 
then that the coefficients b equal to (% + mf) and c equal to 
should both be positive; for the sums and products of positive 
numbers must be positive. 

We have so far proved only the necessity for the coefficients 
to be positive in the real root case. Are these conditions sufficient 
in the real root case? And what about the complex case? Leav- 
ing the latter aside, we easily deduce that positive coefficients are 
also sufficient to give negative roots. The first term in the numer- 
ator of (170) is clearly negative. If we choose the negative sign 
before the radical, we have one root which is clearly negative. But 
the other root will be positive unless the second term outweighs the 
first, i.e., depending upon whether b 2 is less than b 2 — 4 c. By 
hypothesis, c is positive, and it follows that the second term cannot 
outweigh the first. Thus, positive coefficients are both necessary and 
sufficient for negative roots of a quadratic, when the roots are as- 
sumed to be real. 

The complex case is even simpler. If we add the two roots, 
(u + iv) and (u — iv), the complex part cancels out, and — b is 
twice the sum of the real parts. If these are negative, then b must 
be positive. In any case c is positive, or the roots would not be 
complex, i.e., b 2 — 4c is less than zero only if c is greater than zero. 
Therefore, our coefficients must be positive if the roots are Routh- 
ian. This proves necessity. If both b and c are positive and the 
roots are complex, it follows immediately from (170) that u must 
be negative. This proves sufficiency. 

Thus, from our treatment of the real and complex cases, we can 
enunciate the simple theorem : a second degree polynomial has roots 
whose real parts are both negative if, and only if, its coefficients are 
both positive. 

I mm e dia tely we can deduce certain necessary (but not suffi- 
cient) conditions which must hold in the general case if the roots 
are all to be Routhian. Each linear and quadratic factor must 
have positive coefficients, and, therefore, all the coefficients of our 
polynomial must be positive. This necessary condition would also 
be s uffi cient if we dealt only with real roots by virtue of Descartes 
Rule of Signs discussed in every book on algebra. 
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But there is no reason in the general case to exclude complex 
roots. 22 What if some of the b's are zero or negative, but in com- 
bining to form the a coefficients the positive coefficients alway 
outweigh the negative ones? Then our a coefficients’ all being 
positive would not be sufficient. A simple example establishes 
this possibility. 

(X + 1)(X 2 - .001X + 1) = X s 

+ .999X 2 + .999X + 1 = 0. (171) 

Note that all the a’s are positive even though the complex roots 
have real parts which are positive (= + .0005). 

Now that we are warned, a number of possible modes of attack 
are open to us. We may tackle the third and fourth degree cases 
by themselves, knowing that but rarely will higher degree equations 
be involved. Or we may deduce further, stricter necessary condi- 
tions, hoping that when we have found enough of them they will 
be sufficient. 

Thus, in the third degree case we have 

X 3 + a x X 2 + a 2 X + az = (X + w)(X 2 + bX + c) 

= X 3 + (m + b)X 2 + (c + bm)X + me. (172) 

We form the product — never mind why — 

0^2 — a 0 a 3 = (m + b)(c + bm ) — mc{ 1) 

= b(c + mb + m 2 ) + 0 . (173) 

If the roots are all Routhian, c,b, m are all positive, from which it 
follows our expression aia 2 — a Q a 3 is positive. 

We now have a new necessary condition, or four in all, 

Oi > 0, 

c 2 > 0, (174) 

a s > 0, 

CL\Cbi — CLqQ'Z 0 . 

Are they sufficient? If we seek an example which satisfies them 
and yet yields non-Routhian roots, we shall be disappointed. Or 
so the author’s experience suggests. But how can we be sure that 

22 If the polynomial in question is the characteristic equation of a symmetric matrix, 
the roots must all be real, and the simpler analysis can be applied. 
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a more clever, a more zealous investigation will not produce such 
an example? Only by a sufficiency proof. 

As a matter of fact, these four conditions will be later shown 
to be both necessary and sufficient. But there is still one hitch. 
They are not all independent. The reader may verify that the 
condition a 2 > 0 can be deduced from the other three. (Can any 
one be deduced from the other three? Which can, and which 
cannot?) It was the great achievement of Routh to have derived 
an easily calculable set of independent necessary conditions. 

As an introduction to what may be called Routh ’s test deter- 
minants, consider the 3X3 matrix 


~Oi a% 0 ' 

do Sj 0 

.0 ai fl 8 . 


(175) 


Its three naturally ordered principal minors, Aj, A 2 , A 3 , are, re- 
spectively, ai, a 2 ai — a 3 a 0 , a 3 (ai0 2 — a s a a ), and if they are all posi- 
tive, then the necessary and sufficient conditions for a cubic to be 
Routhian are realized. This suggests a general rule for a poly- 
nomial of the nth. degree. List its odd coefficients in a row, 
treating all coefficients with subscripts greater than n as zero: 
0 iaa 0 6 07 • • • . Let each of these elements head a column, whose 
subscripts decrease one at each step. Follow the convention that 
any negative subscript implies that the coefficient is zero. Thus 
we have a square array of numbers, of which only the first n rows 
and columns need concern us. 



d% 

. 

&5 

o 7 

do 

d2 

<H 

do 

0 

(h 

dz 

do 

0 

do 

d2 

&4 


(oo > 0) (176) 


The reader should write out the matrix in full for an 8th and 9th 
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degree polynomial, noting the difference between an odd and 
even n. 

Routh’s theorem states that it is necessary and sufficient in order 
for the real parts of all roots to be negative that the principal minors of 
this matrix all be positive; i.e., 

Ai — Oi 0, 

A o — CiOo — OqOs A 0, 

(177) 

A„_i > 0, 

A n “ a n &n — i 0. 

The condition that all the a’s are positive is contained within these 
conditions; but not vice versa (except if all roots are real). 

The rigorous proof of this theorem cannot be given here. The 
reader must be referred to Routh’s Adams Prize Essay. A brief 
sketch of the reasoning may, however, be indicated. Let X be 
any point in the complex plane, and/(X) be a mapping into another 
complex plane. Consider any closed contour C in the X plane. 
The number of roots of f{X) within this contour can be determined 
by an important theorem of Cauchy, 23 which is essentially topo- 
logical in nature. Traverse the contour C in the positive sense 
(counter-clockwise) and consider the mapped contour of f(X). 
The number of roots in C equals the number of times that f(X) 
loops around the origin ; or if we write f{X) as P + iQ, the number 
of times that the ratio P/Q passes through zero from plus to minus 
in excess of the number of times that the ratio passes through zero 
from minus to plus. 

Routh takes as his contour C the right half complex plane, or 
more rigorously a limitlessly large semicircle to the right of the 
origin and the imaginary axis. The changes in sign of P/Q on the 
semicircle are easily evaluated. Hence, if there are to be no roots 
in the right half plane, the number of changes in sign of P/Q on 
the imaginary axis are uniquely determined; and the polynomials 
fi(X) and fi(X) must have all real roots which separate each 
other, where 

± f(iX) - MX) + iMX). (178) 

n See any textbook on complex variables. 


DIFFERENCE EQUATIONS 435 

By classical Sturmian theory this involves taking the greatest 
common measure of fi(X) and ft(X) to get new functions / 3 (X) , 
fi(X), • • •. Finally, the conditions on these subsidiary functions 
can be shown to be equivalent to the Routh-Hurwitz test deter- 
minants given in (177). 

Numerically, these determinants can be easily evaluated by 
various Gauss- Doolittle methods. As an exercise the reader may 
verify that the conditions for a fourth degree equation are as 
follows : 

®i > 0, 

01O2 - a 0^3 > 0, 

(tiaras — a 0 ®3 2 — >0, ^ ' 

<z 4 > 0. 

It is worth noting that as we move continuously from a stable 
set of coefficients to an unstable set that a„ or A„_i first change sign; 
also that a reverse numbering of the a ’ s must satisfy the stability 
conditions. 24 

37. We have seen that for a difference equation system to be 
stable it is not necessary that the real parts of the roots of the 
characteristic equation be negative. Rather the roots must be 
less than unity in absolute value. Instead of all lying in the left 
half of the complex plane, they must all lie in the unit circle. 

For first and second degree equations it is not hard to work out 
necessary and sufficient conditions by direct algebraic means. For 
higher degree equations such methods are tortuous. Fortunately, 
by a simple use of complex transformations we can reduce our 
problem to the Routhian problem already given a complete 
answer. 26 The complex transformation 

X = > Z = Y~\ ( 180 ) 

converts the unit circle into the left half of the complex plane. 

u In P. A. Samuelson, “Conditions that the Roots of a Polynomial be less than 
Unity in Absolute Value,” Annals of Mathematical Statistics , XII (1941), 360-364, a 
numerical example is worked out and some further facts are noted. If a number of 
A’s vanish, then a further test must be made. 

26 P. A. Samuelson, “Conditions that the Roots of a Polynomial be less than Unity 
in Absolute Value,” Annals of Mathematical Statistics , XII (1941), 360-364, 
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a»(Z + l) n + o t (Z + 1 Y~\Z - 1 ) 
(Z - 1). 


+ 


+ 


a,-i(Z + 1)(Z - l) n_1 + a n (Z - 1)™ 
(Z - 1)« 


( 181 ) 


If the polynomial f(X ) is to have no roots in the unit circle, 
then the numerator of the right-hand side, regarded as a poly- 
nomial in Z, must satisfy the Routhian conditions. If we expand 
out this polynomial in the first degree case, we find as necessary 
and sufficient conditions that the root of X + a\ = 0 be less than 
unity in absolute value; 


1 + > 0, 

1 — ox > 0, 


or 


|d|< 0 . 


(182) 


For the second degree polynomial, X 2 + a t X + o 2 = 0, the 
reader can show that the stability region of the (a it a 2 ) plane is a 
triangle defined by 

1 4 ~ O'l 4 " 02 > 0 , 

1 - 02 > 0, (183) 

1 — <ii 4" > 0, 


It is to be noted that the conditions are always one greater in 
number than the degree of the equation. This is true in the 
general case of the nth degree equation. 


f(X) = X n 4- oqX*- 1 4- ■ • • 4- a n - X X + a n = 0. (184) 

We form the equation in Z, 

a 0 Z n 4- OiZ n_l 4- • • • 4- a, n -\Z 4- a n = £ o,(Z - 1) ; (Z 4- l) n_i 

i=0 

. . (185) 

= Eflizf!) z«(- iy ( n 7 4 ) z»-w 

i=o fc==o i=o \ * ) \ J J 


where 


(/) " t!(s -t)V s = t; 1 = 0 


0, 

0. 


s < t 
t < 0 


( 186 ) 



DIFFERENCE EQUATIONS 
Collecting terms we find 

n 

a 0 ^ 53 Q'ty 

i=*0 

n 

di = 53 — 2i), 

i=0 


n n 


dr — 53 0>i 53 

i =0 fc =0 
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(187) 


5„ = 1 — (Ji + 0^2 — ■ • • 4* (“ l) n *~^n — X 4" (— 

In addition to the condition, 

d 0 > 0 , 

we have the Routhian conditions 

Ai > 0, 

A2 0, 

(188) 


A„ >0, 

where these are naturally ordered principal minors of 


di 

dz 

d B • - * ' 

d 0 

d 2 

di 

0 

di 

d% 

0 

do 

di * • * 


m 

« • » . 


The reader may verify that for a cubic equation, 

do = 1 4~ a,\ 4* ®2 4~ O'Z d - 
5x = 3 4" &i — — 3, 

“ 3 ® 1 2 4~ 3 , 

C 3 = 1 dl 4" ®2 — 0 . 


(189) 


(190) 
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38. Formally, our stability criteria are complete. But for 
most theoretical purposes numerical values of the various coeffi- 
cients are not at hand. It would be highly desirable to be able to 
infer from the qualitative properties of our dynamical matrices 
even before their characteristic determinants have been (labori- 
ously) expanded out whether or not they are stable. 

Unfortunately, this is rarely possible. Often, however, certain 
very general sufficient conditions can be indicated to guarantee 
that a differential equation is stable — or as I shall term it Routhian ; 
or to indicate that a difference equation is stable with moduli less 
than unity — or as I shall term it Tinbergenian in honor of Pro- 
fessor Tinbergen’s outstanding work with difference equations in 
economics. 

The following theorems are listed more or less without proof: 

Theorem 1: x = ax is stable, if the matrix a is symmetrical and 
negative definite; i.e., a is a Routhian matrix. (See Mathematical 
Appendix A, page 19, passim for this classical fact.) 

Theorem 2: If a is quasi-definite, so that — -jy — is nega- 
tive definite, then a is Routhian. ^ Hint: ^(X'X) = X'aX 
= X’ (^ ~y~ J X < 0. Therefore, lim x,(f) = 0, and roots must 
be damped. ^ 

Theorem 3: If a' a — I is negative definite, then a is Tinbergenian ; 
i.e., X t+ i — aXt is stable, and roots of | a — XI| = 0 are all, less than 
unity in absolute value. (Hint: X t+ i!Xt+i — X/(a'a — I)X t 
= A(D x t 2 ) < 0 by hypothesis. Therefore, lim X t = 0.) 

t-+GO 

Theorem 4 (Metzler): If a is “Hicksian” as defined in chapter 
VI, with all its off-diagonal elements positive, then it is necessarily 
Tinbergenian. 

» 

Theorem 5 (Metzler) : If an = 0 and 53 a.j < 1, then (a — I) is 

j=i 

Hicksian, and hence (by Theorem 4) is Tinbergenian. 

Theorem 6 (Mosak) : If a is Hicksian, and if the off-diagonal 
elements of a are all positive, then all of the elements of a -1 are 
negative. 
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Theorem 7 (Oppenheim- Y ntema) : If all the symbols are posi- 
tive in 

( Pi "f" a ij) 

A — t 

bks + H bjes)5lca 

s 

then the pattern of signs in A~ x is 


+ 

.. + 

““ * ' 

» « 

+ 

•• + 

• 

■ 

• 

• 

+ 

+ 

• 

.. : 

+ •• 

+ 
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